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PREFACE 



Paul Halmos famously remarked in his beautiful Hilhert Space Problem Book [22] that "The 
only way to learn mathematics is to do mathematics." Halmos is certainly not alone in this belief. 
The current set of notes is an activity-oriented companion to the study of real analysis. It is 
intended as a pedagogical companion for the beginner, an introduction to some of the main ideas in 
real analysis, a compendium of problems I think are useful in learning the subject, and an annotated 
reading/reference list. 

The great majority of the results in beginning real analysis are straightforward and can be 
verified by the thoughtful student. Indeed, that is the main point of these notes — to convince the 
beginner that the subject is accessible. In the material that follows there are numerous indicators 
that suggest activity on the part of the reader: words such as "proposition", "example", "exercise", 
and "corollary" , if not followed by a proof or a reference to a proof, are invitations to verify the 
assertions made. Of course, the proofs of some theorems, the Hahn-Banach theorem for example, 
are, in my opinion, too hard for the average student to (re)invent. In such cases when I have no 
improvements to offer to the standard proofs, instead of repeating them I give references. 

These notes were written for a year long course in real analysis for seniors and first year graduate 
students at Portland State University. During the year students are asked to choose, in addition to 
these notes, other sources of information for the course, either printed texts or online documents, 
which suit their individual learning styles. As a result the material that is intended to be covered 
during the Fall quarter, the first 10-15 chapters, is relatively complete. After that, when students 
have found other sources they like, the notes become sketchier. 

I have made some choices that may seem unusual. I have emphasized the role played by order 
relations in function spaces by trying to make explicit the vector lattice structure of many important 
examples. This material usually seems to have some "poor relation" status in most real analysis 
texts. To highlight fundamental concepts and avoid, on a first acquaintance, distracting technical 
details I have treated compact spaces more thoroughly than locally compact ones and real measures 
more thoroughly than extended real valued ones. 

There are of course a number of advantages and disadvantages in consigning a document to 
electronic life. One advantage is the rapidity with which links implement cross-references. Hunting 
about in a book for lemma 3.14-23 can be time-consuming (especially when an author engages in 
the entirely logical but utterly infuriating practice of numbering lemmas, propositions, theorems, 
corollaries, and so on, separately). A perhaps more substantial advantage is the ability to correct 
errors, add missing bits, clarify opaque arguments, and remedy infelicities of style in a timely 
fashion. The correlative disadvantage is that a reader returning to the web page after a short 
time may find everything (pages, definitions, theorems, sections) numbered differently. (ErgXis an 
amazing tool.) I will change the date on the title page to inform the reader of the date of the last 
nontrivial update (that is, one that affects numbers or cross-references). 

The most serious disadvantage of electronic life is impermanence. In most cases when a web 
page vanishes so, for all practical purposes, does the information it contains. For this reason (and 
the fact that I want this material to be freely available to anyone who wants it) I am making use of 
a "Share Alike" license from Creative Commons. It is my hope that anyone who finds this material 
useful will correct what is wrong, add what is missing, and improve what is clumsy. For more 
information on creative commons licenses see http://creativecommons.org/. Concerning the text 
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itself, please send corrections, suggestions, complaints, and all other comments to the author at 

erdman@pdx.edu 
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Freiktur Fonts 

In these notes Fraktur fonts arc used (most often for families of sets and families of operators). 
Below are the Roman equivalents for each letter. When writing longhand or presenting material 
on a blackboard it is usually best to substitute script English letters. 
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CHAPTER 1 



SETS 

1.1. Set Notation 

Everything in these notes is defined ultimately in terms of two primitive (some say, undefined) 
concepts: set and set membership. We assume that these are already familiar. In particular, we 
take it as understood that individual elements (or members, or points) can be regarded collectively 
as a single set (or family, or collection). It is occasionally useful to have all these different names. 
Many find a discussion concerning "a collection of families of sets" somewhat less confusing than 
trying to keep track of the identities of the members of the cast in "a set of sets of sets" . 

To indicate that x belongs to a set A we write x £ A; to indicate that it does not belong to A 
we write x ^ A. Synonymous with "x belongs to A" are "x is an element of A" , "x is a member 
of j4" , and "x is a point in A" . 

We specify a set by listing its members between braces (for example, {1, 2, 3, 4, 5} is the set of 
the first five natural numbers), by listing some of its members between braces with an ellipsis (three 
dots) indicating the missing members (for example, {1, 2, 3, ... } is the set of all natural numbers), 
or by writing {x: P{x)} where P{x) specifies what property the variable x must satisfy in order to 
be included in the set (for example, {x: < x < 1} is the closed unit interval [0, 1]). 

1.1.1. CAUTION. Be aware that notational conventions for sets vary according to context. For 
example, if the statement of a theorem starts, "Let {si, S2, ■ ■ ■ , s„} be a set," the author most often 
intends the set to contain n distinct members. But if in a proof we read, "Let {ri,r2, . . . ,r„} be 
the roots of the n^^ degree polynomial p" , we do not conclude that the elements of the set are 
necessarily unique. (An n*^ degree polynomial has n roots only when we count multiplicity.) 

In a similar vein I adopt a convention in these notes: If I write, "Let a, b, and c be points in a 
topological space . . . ," I am not assuming that they are distinct, but if I write, "Let a, b, and c be 
three points in a topological space . . . ," I am assuming them to be distinct. 

1.1.2. CAUTION. In attempting to prove a theorem which has as a hypothesis "Let 5 be a 
set" do not include in your proof something like "Suppose S = {si, S2, ■ ■ ■ , Sn}" or "Suppose 
S = {si, S2, ■ ■ ■ }". In the first case you are tacitly assuming that S is finite and in the second that 
it is countable. Neither is justified by the hypothesis. 

1.2. Families of Sets 

In these notes we devote much attention to such objects as cr-algebras and topologies, which 
are families of sets (that is, sets of sets). Ordinarily we will denote a family of sets by a single 
letter, as in "Let 6 be a family of sets." 

1.2.1. CAUTION. A single letter & (an S in fraktur font) is an acceptable symbol in printed 
documents. Don't try to imitate it in hand-written work or on the blackboard. Use script letters 
instead. 

Later in section 7.2 of chapter 7 we will consider a closely related, but quite different, concept 
of an indexed fam,ily of sets. 

1.2.2. Example. Let (3 be the family {{x G M: |x — a| < 6} : a, 6 G M} and T be the family 
{{x E M:a < x < b}: a,b G M}. Then S = T. Characterize this family of sets in ordinary 
language (without the use of symbols). 
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1.3. Subsets 

1.3.1. Definition. Let S and T be sets. We say that 5 is a subset of T and write S* C T (or 
T D S) if every member of S belongs to T. If 5 C T we also say that S is contained in T or that 
T CONTAINS S. If 5 C T but S T, then we say that 5 is a proper subset of T (or that S is 
PROPERLY CONTAINED IN T, or that T PROPERLY CONTAINS S) and write S ^T. 

1.3.2. Example. The empty set (or null set), which is denoted by 0, is defined to be the set 
which has no elements. (Or, if you like, define it to be {.x: x ^ x}.) It is regarded as a subset of 
every set, so that C S is always true. (Note: "0" is a letter of the Danish alphabet, not the Greek 
letter "phi".) 

1.3.3. Example. If 5 is a set, then the POWER SET of S, which we denote by ^{S), is the set of 
all subsets of S. 

1.3.4. Remark. In these notes equality is used in the sense of identity. We write x = y to indicate 
that X and y are two names for the same object. For example, we write {5, 7, a, b} = {b, 7, 5, a} 
because sets are entirely determined by their members and the indicated sets have exactly the same 
members. Likewise we write 0.5 = 1/2 = 3/6 = l/y/i because 0.5, 1/2, 3/6, and l/\/4 are different 
names for the same real number. You have probably encountered other uses of the term equality. 
In many high school geometry texts, for example, one finds statements to the effect that a triangle 
is isosceles if it has two equal sides (or two equal angles) . What is meant of course is that a triangle 
is isosceles if it has two sides of equal length (or two angles of equal angular measure). Wc also 
make occasional use of the symbol := to indicate equality by definition. Thus when we write a := b 
we are giving a new name a to an object b with which we are presumably already familiar. 

1.4. Unions and Intersections 

1.4.1. Definition. If 2t is a family of subsets of a set S, then we define the UNION of the family 
21 to be the set of all x belonging to S such that x G A for at least one set A in 21. We denote the 
union of the family 21 by |J 21 (or by [j^^f^A, or by U{^- ^ ^ 21}). Thus x e ii and only if 
there exists ^ G 21 such that x G A. 

If 21 is a finite family of sets Ai, . . . , An, then we may write IJfc=i ^fc or U ^2 U • • • U An for 
|J2t. Similarly, if 21 is a countably infinite family of sets Ai, A2, . . . , then we may write Ufc^i 
or ^1 U ^2 U . . . for U 21. 

In a similar fashion, if 21 is a family of subsets of a set S, then we define the intersection of 
the family 21 to be the set of all x belonging to S such that a; G A for every set A in 21. We denote 
the intersection of the family 2t by QSl (or by PlAea^' ^y r\{A: A G 21}). Thus x G fl^l if and 
only if X G ^ for every ^ G 21. 

If 21 is a finite family of sets Ai, . . . , An, then we may write 0^=1 ^fc or n ^2 H • • • H An for 
f] 2t. Similarly, if 21 is a countably infinite family of sets Ai, A2, . . . , then we may write PlfcLi 
or Ai n ^2 n . . . for n 21. 

The next two propositions are the familiar facts that union distributes over intersection and 

intersection distributes over union. 

1.4.2. Proposition. Let 21 and *B be nonempty families of sets and ^ G 21. Then 

(a) Au{f]^) = f]{AuB: B G 53}, and 

(b) (n2l)u(n®) =n{^UB: Ag21 andSG©}. 

1.4.3. Proposition. Let 2t and OS be nonempty families of sets and A G 21. Then 

(a) A n ( U ^ ) = U{^ n S : 5 G «}, and 

(b) (U2l)n(U^) =U{^nS: ^g21 and5G*B}. 

1.4.4. Definition. Sets S and T are said to be disjoint if SCiT = 0. More generally, a family S of 
sets is a disjoint family (or a pairwise disjoint or a mutually disjoint family) if 5 n T = 
whenever S and T are distinct (that is, not equal) sets which belong to S. 
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1.4.5. CAUTION. Let (3 be a family of sets. Do not confuse the following two statements. 

(a) S is a (pairwise) disjoint family. 

(b) ne = 0. 

Certainly, if (3 contains more than a single set, then (a) implies (b). But if 6 contains three or 
more sets the converse need not hold. For example, let S = {0,1}, T = {3,4}, U = {0,2}, and 
6 = {S,T,U}. Then & is not a disjoint family (because S CiU is nonempty), but P| © = 0. 

1.4.6. Definition. Let A and B be sets. When A and B are disjoint we will often use the notation 
A^tlB instead AuB (to emphasize the disjointness of A and B). When C = A^tlB we say that C 
is the DISJOINT UNION of A and B. Similarly we frequently choose to write the union of a pairwise 
disjoint family 21 of sets as 1+J 21. And the notation l+J_;ygyv may be used to denote the union of a 
pairwise disjoint indexed family {Ax : A G A} of sets. When C = (+J 21 or C = we say that 
C is the DISJOINT UNION of the appropriate family. (Indexed families of sets are discussed in more 
detail in section 7.2.) 

1.4.7. Proposition. Let Ai, A2, As, . . . be sets. Then there exist sets Bi, B2, B^, ■ ■ ■ such that 

n n 00 00 

U ^fc = l±J -Bfc for each n G N and, consequently, \J A^ = l+l 

k=l k=l k=l k=l 

1.4.8. Definition. A family 6 of subsets of a set S COVERS (or is a COVER FOR, or is a covering 
for) a set a if a C y 6. 

1.4.9. Exercise. Find a family of open intervals which covers the set N of natural numbers and 
has the property that the sum of the lengths of the intervals is 1. Hint. Yl'h=i — ^■ 

1.4.10. CAUTION. It is a good idea to be alert to some differing attitudes towards "the" empty 
set. In particular, is the empty family of subsets of a nonempty set S the same thing as "the" 
empty set? Or does it depend on SI There appears to be general agreement that the union of the 
empty family of subsets of S is empty. (For an element x to belong to the union of the empty family 
there would have to be a member of that family to which x belongs, which cannot be.) But what 
about the intersection of the empty family? For x to belong to that intersection it must belong to 
each member of the empty family, which it vacuously does because there are no members of the 
empty family. So, some argue, every x belongs to the intersection of the empty family. But every 
X in what universe of discourse? Many take the view that since we are talking about subsets of a 
fixed set 5, only those x in 5 qualify. Thus P| = 5 when is the empty family of subsets of S. 
This is all quite reasonable, but it does have some odd consequences. One is that there are lots of 
different empty families of sets (since they have differing intersections), and this runs counter to 
our usual agreement that a set is entirely determined by its elements. (Perhaps we should be more 
modest and assert only that nonempty sets are determined by their elements.) Also it is rather 
strange to have the union of a family be a proper subset of its intersection. 

1.5. Complements 

We ordinarily regard the sets with which we work in a particular context as being subsets 
of some appropriate universal set. Think of "universal" in the sense of delimiting a universe of 
discourse rather than being all- encompassing. For each set A contained in this universal set 5 we 
define the complement of A , denoted by A^, to be the set of all members of S which do not 
belong to A. That is, x ^ A^ \i and only \i x ^ S and x ^ A. 

1.5.1. Example. Let A be the closed interval (—00, 3]. If nothing else is specified, we think of 
this interval as being a subset of the real line M (our universal set). Thus A'^ is the set of all x in 
M such that x is not less than or equal to 3. That is, A^ is the interval (3, 00). 

1.5.2. Example. Let A be the set of all points (x,y) in the plane such that x > and y > 0. 
Then A'^ is the set of all points (x, y) in the plane such that either x < or y < 0. That is, 

A' = {(x,y): X < 0}U{(x,2/): y < 0}. 
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The two following propositions are De Morgan's laws for sets. They say that the comple- 
ment of the union of a family is the intersection of the complements (proposition 1.5.3), and the 
complement of the intersection of a family is the union of the complements (proposition 1.5.4). As 
you may expect, they are obtained by translating into the language of sets the facts of logic which 
go under the same name. 

1.5.3. Proposition. Let & be a family of subsets of a set S. Then 

{U&y = n{A^: A G &}. 

1.5.4. Proposition. Let & be a family of subsets of a set S. Then 

{ney = U{A'': A£6}. 

1.5.5. Definition. If A and B are subsets of a set S, we define the complement of B relative 
TO A, (or the set difference of A and B), denoted hy A \ B, to be the set of all x G 5" which 
belong to A but not to B. That is, 

A\B := AnB". 
The operation \ is called set subtraction. 

1.5.6. Example. Let A = [0, 5] and B = [3, 10]. Then A\B = [0,3). 

It is an elementary but frequently useful fact that the union of two sets can be written as a 
disjoint union (that is, the union of two disjoint sets). 

1.5.7. Proposition. Let A and B be subsets of a set S. Then A\B and B are disjoint sets whose 
union is AU B. 

1.5.8. Exercise. Let ^ = {x G M: x > 5}, B = (-4,8), C = (-2,12], D = [10,15], and 
e = {A^,B,C\D}. 

(a) Find U©- 

(b) Find n©- 

1.5.9. Proposition. If A, B, and C are subsets of a set S, then 

{Ar\B)\c = (A\C)r\{B\c) = Ar\{B\c) = {Ar\B)\{Ar\C). 

1.6. Symmetric Difference 

1.6.1. Definition. Let A and B be subsets of a set S. Then the symmetric difference A/\B 
of A and B is defined by 

AAB ■.= {A\B)[J{B\A). 

1.6.2. Proposition. Let A and B be subsets of a set S. Then A A B = {AU B)\{An B). 

1.6.3. Proposition. If A, B, and C are subsets of a set S, then A A C C (^ A B) U (5 A C). 

1.6.4. Proposition. Let A, B, and C be subsets of a set S. Then 

(a) A A (B A C) = (^ A 5) A C; 

(b) AABr^C = Ar\Cl\Br\C; 

(c) AAS = A^; 

(d) A A = A; and 

(e) AAA = %. 
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1.7. Notation for Sets of Numbers 

Here is a list of fairly standard notations for some sets of numbers which occur frequently 
these notes: 

C is the set of eomplcx numbers 
M is the set of real numbers 

M" is the set of all n-tuples (ri, r2, . . . , r„) of real numbers 

M"*" = {x eM.: X > 0}, the positive real numbers 

Q is the set of rational numbers 

Q"*" = e Q: X > 0}, the positive rational numbers 

Z is the set of integers 

Z"*" = {0, 1,2,...}, the positive integers 

N = {1, 2, 3, . . . }, the set of natural numbers 

N„ = {1,2,3,..., n} the first n natural numbers 



[a,b] 


= G M: 


a < X < b} 


[a,b) 


= {x e M: 


a < X < b} 


{a,b] 


= {x G M: 


a < X < b} 


{a,b) 


= {x e M: 


a < X < b} 


[a, oo 


) = {x G K 


: a < x} 


(a, oo 


) = {x G 1 


I: a < x} 


(— oo, 


6] = {x G 


R: X <b} 


(— oo, 


6) = {x G 


R: X < b} 



D = {(x, y) G : .x^ + < 1}, the open unit disc 
T = §^ = {(x, y) G : x^ + = 1}, the unit circle 



CHAPTER 2 



FUNCTIONS 

2.1. Cartesian Products 

Ordered pairs are familiar objects. They are used among other things for coordinates of points 
in the plane. In the first sentence of chapter 1 it was promised that all subsequent mathematical 
objects would be defined in terms of sets. So here just for the record is a formal definition of 
"ordered pair" . 

2.1.1. Definition. Let x and y be elements of arbitrary sets. Then the ordered pair {x,y) is 
defined to be {{x, y}, {x}}. This definition reflects our intuitive attitude: an ordered pair is a set 
{x,y} with one of the elements, here x, designated as being "first". Thus we specify two things: 
{x, y} and {x}. 

Ordered pairs have only one interesting property: two of them are equal if and only if both their 
first coordinates and their second coordinates are equal. As you will discover in the next exercise, 
this fact follows easily from the definition. 

2.1.2. Proposition. Let x, y, u, and v he elements of arbitrary sets. Then {x,y) = {u,v) if and 
only if X = u and y = v. 

Hint for proof . Do not assume that the set{3;,y} has two elements. If x = y, then {x,y} has 
only one element. 

In section 8.3 we define the general notion of product. For the moment we concentrate on the 
most important of these — the Cartesian product of sets. 

2.1.3. Definition. Let S and T be sets. The Cartesian product of S and T, denoted hy S xT, 
is defined to he {{x,y): x £ S and y G T}. The set 5 x 5 is often denoted by 5^. 

In section 8.3 we define the general notion of product. For the moment we concentrate on the 
most important of these — the Cartesian product of sets. 

2.1.4. Proposition. Let S, T, U , and V he sets. Then 

(a) (5 X r) n ([/ X y) = (5 n [/) X (T n v); 

(b) (5 X r) U ([/ X y) C (5 U [/) X (T U V); and 

(c) equality need not hold in (h). 

2.2. Relations 

2.2.1. Definition. A relation from a set S" to a set T is a subset of the Cartesian product S xT. 
A relation from the set S to itself is often called a relation on or a relation among memhers of S. 

There is a notational oddity concerning relations. To indicate that an ordered pair (a, b) belongs 
to a relation i? C S" x T, we almost always write something like aRb rather than (o, b) G i?, which 
we would expect from the definition. For example, the relation "less than" is a relation on the real 
numbers. Technically then, since < is a subset of M x M, we could (correctly) write expressions 
such as (3, 7) G<. Of course we don't. We write 3 < 7 instead. And we say, "3 is less than 7", not 
"the pair (3, 7) belongs to the relation less than". This is simply a matter of convention; it has no 
mathematical or logical content. 
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In definition 2.2.1 we defined a relation i? on a set 5 as a subset of the Cartesian product S x S. 
We now define some conditions which a relation may satisfy. 

2.2.2. Definition. Let R he a relation on a set nonempty S. We will, as usual, write aRb for 
(a, b) G R. The relation 

(a) R is REFLEXIVE if aRa holds for every o G 5; 

(b) R is TRANSITIVE if aRc holds whenever both aRb and bRc do; 

(c) R is SYMMETRIC if aRb implies bRa; and 

(d) R is ANTISYMMETRIC if aRb and bRa together imply a = b 

Notice that the relation of equality on elements of a set is both symmetric and antisymmetric. 

2.3. Functions 

Functions are familiar from beginning calculus. Informally, a function consists of a pair of sets 
and a "rule" which associates with each member of the first set (the domain) one and only one 
member of the second (the codomain). While this informal "definition" is certainly adequate for 
most purposes and seldom leads to any misunderstanding, it is nevertheless sometimes useful to 
have a more precise formulation. This is accomplished by defining a function to be a special type 
of relation between two sets. 

2.3.1. Definition. A function / is an ordered triple {S,T, G) where S and T are sets and G is 
a subset of 5 x T satisfying: 

(1) for each s G S" there is a t G T such that (s, t) G G, and 

(2) if (s,ti) and (s,t2) belong to G, then ti = t2- 

In this situation we say that / is a function from S into T (or that / maps S into T) and write 
f:S^T. The set S is the domain (or the input space) of /. The set T is the codomain 
(or TARGET space, or the output space) of /. And the relation G is the graph of /. In order 
to avoid explicit reference to the graph G it is usual to replace the expression "(x,y) G G" by 
"y = fi^)"] the element f{x) is the image of x under /. In these notes (but not everywhere!) the 
words "transformation" , "map" , and "mapping" are synonymous with "function" . The domain of 
/ is denoted by dom/. 

2.3.2. Notation. If S and T are sets, then the family of all functions with domain S and codomain 
T is denoted by T{S, T). Since real valued functions play such a large role in real analysis, we will 
shorten the notation for the family of real valued functions on a set S from 7^(5, M) to J-{S). 

2.3.3. Example. There are many ways of specifying a function. Statements (1)~(4) below define 
exactly the same function. We will use these (and other similar) notations interchangeably. 

(1) For each real number x we let f{x) = x^. 

(2) Let / = (S, T, G) where 5 = T = M and G = {(x, x^) : x G M}. 

(3) Let / : M ^ M be defined by /(x) = x^. 

(4) Consider the function /: M — > R: x x^. 

2.3.4. Convention. A real valued function is a function whose codomain lies in M. A function 
of a real variable is a function whose domain is contained in M. Some real valued functions of a 
real variable may be specified simply by writing down a formula. When the domain and codomain 
are not specified, the understanding is that the domain of the function is the largest set of real 
numbers for which the formula makes sense and the codomain is taken to be M. 

2.3.5. Example. Let f{x) = (x^ + x)^^. Since this formula is meaningful for all real numbers 
except —1 and 0, we conclude that the domain of / is M \ {—1, 0}. 
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2.4. Images and Inverse Images 

2.4.1. Definition. Iff: S^TandACS, then f^{A), the image of A under /, is {/(x): x G A}. 
It is common practice to write f{A) for f^[A). The set f^(S) is the range (or image) of /; 
usually we write ran / for f^{S). 

2.4.2. Definition. Let /: 5 — > T and B C T. Then f^{B), the inverse image of B under /, 
is {a; G S: f{x) G S}. In many texts f'^{B) is denoted by f~^{B). This notation may cause 
confusion by suggesting that a function always has an inverse. 

X 

2.4.3. Exercise. Let fix) = . 

1 — X 

(a) Find /^([0,a]) for a > 0. 

(b) Find 

2.4.4. Exercise. Let 

. , , I X + 1, for X < 1 

l8 + 2x — x^, for X > 1. 

Let A = (-2, 3) and B = [0, 1]. Find f^{A) and f^{B). 

2.5. Composition of Functions 

2.5.1. Definition. Let f : S ^ T and g: T ^ U. The composite of g and /, denoted by 5 o /, 
is the function taking S to U defined by 

(5o/)(x)=5(/(x)) 

for all X in 5. The operation o is COMPOSITION. We again make a special convention for real valued 
functions of a real variable: The domain of 5 o / is the set of all x in M for which the expression 
g{f{x)) makes sense. 

2.5.2. Proposition. Composition of functions is associative but not necessarily commutative. That 
is, if f : S ^ T , g: T ^ U , and h: U ^ V , then h o [g o f) = [h o g) o f . Also, it need not be true 
that fog and g o f are equal. 

2.5.3. Exercise. For real valued functions of a real variable does composition distribute over 
addition? That is, is it always true that f o {g + h) = {f o g) + (f o h) (assuming that both sides 
are defined)? How about {f + g) o h = {f o h) + {g o h)7 

2.5.4. Proposition. If f: S ^ T and g: T ^ U, then 

(a) (g o f)^(A) = g^{f^{A)) for every A C S; and 

(b) (g o f)^{B) = f^{g^{B)) for every BCU. 

2.6. The Identity Function 

2.6.1. Definition. If 5 is a set one member of the family T{S,S) of functions mapping S into 
itself is particularly noteworthy, the IDENTITY FUNCTION on S. It is defined by 

Is: S ^ S: X X. 

Depending on context we may denote the identity function on a set S hy Is, or 7, or id^, or id. 
It is easy to see that the identity function is characterized algebraically by the conditions: 

if/:i?^5, then 750/ = / 

and 

iig: S^T, then go Is = g. 
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2.6.2. Definition. More general than the identity function are the inclusion maps. li A C. S, then 
the INCLUSION MAP taking A into S is defined by 

6^ ^ . .A y S . X I y x . 

When no confusion is likely to result, we abbreviate to i. Notice that ig g is just the identity 
map Is- 



2.7. Diagrams 

It is frequently useful to think of functions as arrows in diagrams. For example, the situation 
j : R ^ U, f : R ^ S, k: S ^ T, h: U ^ T may be represented by the following diagram. 




The diagram is said to COMMUTE (or to be a COMMUTATIVE diagram) iiho j = kQ f. 
Diagrams need not be rectangular. For instance. 




is a commutative diagram \i g = ko f. 

2.7.1. Example. Here is a diagrammatic way of stating the associative law for composition of 
functions. If the triangles in the diagram 




commute, then so does the rectangle. 



2.8. Some Special Functions 

2.8.1. Definition. If /: 5 -)• T and AQS, then the RESTRICTION of / to A, denoted by /|^, is 
the function f ° i-ao- That is, it is the mapping from A into T whose value at each x in ^ is f {x). 



S 
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2.8.2. Definition. Suppose that g: A-^T and ^4 C 5. A function /: 5 — )- T is an extension of 
g to S if = g, tiiat is, if tlie diagram 




commutes. 

There are many ways of combining functions. Composition we have already encountered (sec- 
tion 2.5). Here are two more ways. 

2.8.3. Notation. Suppose /: i? — >• T and g: S ^ U are mappings between sets. Then f x g is 
the function mapping R x S into T x U defined by 

{fxg)ir,s) = {f{r),g{s)) 

for all r G i? and s £ S. 

2.8.4. Notation. Suppose f:T^U and g: T ^ V are mappings between sets. Then {f,g) is 
the function mapping T into U x V defined by 

{f,9m = {f{t),g{t)) 

for ah t e T. 

2.8.5. Example. Let T = [0,2tt], f{t) = cost, and g{t) = s'mt for ah t G T. Then {f,g) is the 
usual (counterclockwise) parametrization of the unit circle in E?. 

2.8.6. Definition. A function f : S ^ T between sets is a constant function if ran / contains 
exactly one element of T. It is conventional to use the value of the constant function as its name. 
Thus, for example, the constant function on a set S whose value at each s G 5 is 2 is denoted by 2. 
(Thus of course the notation fails to tell us the domain of the function; we must deduce that from 
context.) 

2.8.7. Definition. If A is a subset of a nonempty set S, we define : 5* — )• M, the characteristic 
FUNCTION of A by 

Jl, iixGA 
I 0, otherwise 



2.9. Injections, Surjections, and Bijections 

2.9.1. Definition. A function / is injective (or one-to-ONe) if x = y whenever f{x) = f{y). 
That is, / is injective if no two distinct elements in its domain have the same image. For a real 
valued function of a real variable this condition may be interpreted graphically: A function is one- 
to-one if and only if each horizontal line intersects the graph of the function at most once. An 
injective map is called an injection. 

2.9.2. Exercise. Find an injective mapping from {x £ Q: x > 0} into N. 

2x — 5 

2.9.3. Example. The function x i— )• is injective. 

3x + 4 

2.9.4. Example. The function x i— )• 2x^ — x — 15 is not injective. 

2.9.5. Definition. A function is surjective (or onto) if its range is equal to its codomain. 

2.9.6. Exercise. Find a surjection (that is, a surjective map) from [0, 1] onto [0,oo). 
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2.9.7. Definition. A function is buective if it is both injective and surjective. A bijective map 
is called a bijection or a one-to-ONE correspondence. 

2.9.8. Exercise. Give an explicit formula for a bijection between 7L and N. 

2.9.9. Exercise. Give an explicit formula for a bijection between M and the open interval (0, 1). 

2.9.10. Exercise. Give a formula for a bijection between the interval [0, 1) and the unit circle 
+ = 1 in the plane. 

2.9.11. Proposition. Let f : S ^ T and g: T ^ U . 

(a) If f and g are injective, so is g o f. 

(b) If f and g are surjective, so is g o f. 

(c) If f and g are bijective, so is g o f . 

It is important for us to know how the image and the inverse image /■*" of a function / 
behave with respect to unions, intersections, and complements of sets. The basic facts are given in 
the next six propositions. Although these results are quite elementary, they are quite useful. 

2.9.12. Proposition. Let f : S ^ T and B C T. Then 

(a) f^if^iB))QB; 

(b) equality need not hold in (a); but 

(c) equality does hold in (a) if f is surjective. 

2.9.13. Proposition. Let f : S ^ T and A C S . Then 

(a) AC /-(/-(A)); 

(b) equality need not hold in (a); but 

(c) equality does hold in (a) if f is injective. 

2.9.14. Exercise. Prove the converse of part (c) of proposition 2.9.13. That is, show that if 
/: S" — )• T and (A)) = A for all AOS, then / is injective. Hint. Suppose f{x) = f{y). Let 
A = {x}. Show that y € f^{f^{A)). 

2.9.15. Proposition. Let f : S ^ T and D <Z T . Then 

f^iD') = {f^{D)y. 

2.9.16. Proposition. Let f : S ^ T and A C S . 

(a) /// is injective, then f^{A^) C (/^(A))^ 

(b) /// is surjective, then f^iA^) 5 (/"^(A))'. 

(c) /// is bijective, then f^{A^) = {f^{A)y. 

2.9.17. Proposition. Let f : S ^ T and ^ C ^3(5). Then 

(a) f^{m^f]{f^{A):AG^}; 

(b) equality need not hold in (a); but 

(c) if f is injective, then equality holds in (a); and furthermore 

(d) /-(U2t) = U{/^(^):^e2t}. 

2.9.18. Proposition. Let f : S ^ T and ^ ^ V{T). Then 

(a) /^(n^B) = n{/^(^) : B G 53}; and 

(b) f^{[j^) = [j{f^{B):B€^}. 

2.10. Inverse Functions 

Let /: 5 — )• T and g: T ^ S. If g o f = Ig, then is a left inverse of / and, equivalently, 
/ is a RIGHT inverse of g. We say that / is invertible if there exists a function from T into 
S which is both a left and a right inverse for /. Such a function is denoted by f~^ and is called 
the inverse of /. (Notice that the last "the" in the preceding sentence requires justification. See 



2.10. INVERSE FUNCTIONS 



13 



proposition 2.10.1 below.) A function is invertible if it has an inverse. According to the definition 
just given, the inverse of a function / must satisfy 

/o/-i = /r and r^of = Is. 

A simple, but important, consequence of this is that for an invertible function, y = f{x) if and 
only if X = f^^{y). (Proof: if y = f{x), then f^^{y) = f^^{f{x)) = Is{x) = x. Conversely, if 

X = rHy), then fix) = fif-Hy)) = hiy) = y-) 

2.10.1. Proposition. A function can have at most one inverse. In fact, if a function has both a left 
inverse and a right inverse, then these inverses are equal (and therefore the function is invertible). 

2.10.2. Exercise. The arcsine function is defined to be the inverse of what function? (Hint. The 
answer is not sine.) What about arccosine? arctangent? 

The next two propositions tell us that a necessary and sufficient condition for a function to have 
right inverse is that it be surjective and that a necessary and sufficient condition for a function to 
have a left inverse is that it be injective. Thus, in particular, a function is invertible if and only if 
it is bijective. In other words, the invertible members of J-'{S, T) are the bijections. 

2.10.3. Proposition. Let S ^ 9. A function f : S ^ T has a right inverse if and only if it is 
surjective. 

Proof. Suppose that / has a right inverse fr. For each y G T it is clear that y = Iriy) = 
f{fr{y)) £ ran/; so ran / = T and / is surjective. 

Conversely, suppose that / is surjective. Then for every y £ T the set f^{{y}) is nonempty. 
For each y G T let Xy be a member of /^({y}) and define 

fr'. T ^ S: y ^ Xy . 

Then f{fr{y)) = f{xy) = y, showing that fr is a right inverse of /. □ □ 

2.10.4. Remark. In the preceding proof we have made use of an important set theoretic axiom 
called the axiom of choice. This axiom says, roughly, that from an arbitrary family of nonempty 
sets it is possible simultaneously to extract one member from each set. Here is a more formal 
statement of the axiom: 

2.10.5. Axiom (Axiom of Choice I). 7/21 is a family of nonempty sets, then there exists a function 
/ : 2t ^ U 2t such that f{A) € A for every ^ G 21. 

Such a function is called a choice function. Notice that in the preceding proof fr is a choice 
function; it "chooses" for each y one member Xy of the set /'*~({y})- Not every mathematician 
approves of the use of the axiom of choice; some complain that it should not be invoked because it 
is not "constructive". However, most mathematicians accept it, and in these notes it will be used 
freely. 

Another assertion that is equivalent to the axiom of choice says that a nonempty product of 
nonempty sets is nonempty. 

2.10.6. Axiom (Axiom of choice II). If (Ax)x,z\ is a nonempty indexed family of pairwise disjoint 
nonempty sets, then the Cartesian product Y\^^^f^ Ax is nonempty. (For a definition of the Cartesian 
product see 8.3.6.) 

Here is yet another version of the axiom of choice. 

2.10.7. Axiom (Zermelo's postulate). 7/21 is a disjoint family of nonempty sets, then there exists 
a set C such that C r\ A has exactly one member for each ^4 G 21. 

In chapter 5 we will introduce another axiom, Zorn's lemma 5.3.1, which is equivalent to the 
axiom of choice. For proofs that all these versions of the axiom of choice are indeed equivalent, 
consult Hewitt and Stromberg [.^3], section 3; Brown and Page [""], section 1.5; Kelley [27], pages 
31-36; or any standard text on set theory. 
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2.10.8. Proposition. Let S ^ A function /: S" — J- T has a left inverse if and only if it is 
infective. 

2.10.9. Proposition. If a function f is bijective, then {f ^^)^ = f^ ■ 

ax ~\- b 

2.10.10. Exercise. Let f(x) = where a, b, c, d £ M. and not both c and d are zero. 

cx + d 

(a) Under what conditions on the constants a, b, c, and d is f injective? 

(b) Under what conditions on the constants a, b, c, and d is f its own inverse? 

2.11. Equivalence Relations and Quotients 

2.11.1. Definition. A relation i? on a set S is an equivalence relation if it is reflexive, 
symmetric, and transitive. If s is an element of S, then the equivalence CLASS containing s is 
the set of all the elements in S which are equivalent to s. We usually denote the equivalence class 
containing s by [s] . Any member t of the equivalence class [s] is called a representative of that 
equivalence class. The family of all equivalence classes of members of S is called the quotient of 
5 by i? (or 5 modulo R) and is denoted by S/R. The map vr : 5 S/R : s [s] is the quotient 
MAP. 

2.11.2. Exercise. In each item of the following list is a set and i? is a relation on S. Determine 
if each relation is reflexive, if it is symmetric, if it is transitive, and if it is an equivalence relation. 

(a) S = M; xRy if and only if x 7^ y. 

(b) 5 = N X N; (m, n)R{p, q) if and only if mq = np. 

(c) S is the set of all humans; xRy if and only if x is a sister of y. 

(d) S = Z; mRn if and only if — n? is divisible by 5. 

(e) = M; xRy if and only if + = 1. 

(f) S = M^; (n, v)R{x, y) if and only if u'^ + v"^ = x"^ + y"^ . 

2.11.3. Example. Define a relation ^ on the set S = {1, 2, . . . , 12} by 

X ~ y if and only if x"^ — is divisible by 6. 

The relation ~ is an equivalence relation on S. How many equivalence classes are there? What 
elements are in each one? 

2.11.4. Example. Define a relation ~ on the set D = {(x,y) G : (x,y) / (0,0)} by 

(a, 6) ~ (c, d) if and only if (?b = a^d. 

Show that ~ is an equivalence relation. Identify (that is, describe geometrically) the set D/ ~ of 
equivalence classes of D with respect to ~. 

2.11.5. Definition. A family 21 of nonempty subsets of a set 5 is a partition of the set S if the 
family 21 is pairwise disjoint and its union is all of S. 

2.11.6. Example. Let {Ai : i G /} be a partition of a set A and {Bj : j G J} be a partition of a 
set B. Then {Ai x Bj : {i,j) G / x J} is a partition of A x B. 

2.11.7. Proposition. 7/~ is an equivalence relation on a nonempty set S then the family S'/~ of 
equivalence classes generated by is a partition of S and the quotient map tt : S" — )• 5'/~ : s 1— )■ [s] 
is surjective. 

The word "partition" is used as a verb as well as a noun: in light of the previous proposition, 
one may say that the equivalence relation ~ partitions the set S. 

2.11.8. Exercise. Prove the converse of proposition 2.1L7: Every partition 5^ of S" is induced by 
some equivalence relation. (What exactly does this last assertion mean?) 
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2.11.9. Theorem (Fundamental quotient theorem for sets). Let ~ he an equivalence relation on 
a set S and vr: 5 — ?■ 5/~ 6e the quotient map. If f : S ^ T is a function from S into another set 
T which is constant on the equivalence classes of S (that is, if x ^ y then f{x) = f{y)), then there 
exists a unique function f : S/^^T which makes the following diagram commute. 

S 

vr 

S/ 

f 

Furthermore, f is injective if and only if f takes different values on different equivalence classes 
(that is, if X y, then fix) 7^ /(y) )', and f is surjective if and only if f is. 

When ~ is an equivalence relation on a set S, we often have occasion to work with the equiv- 
alence classes generated by ~. This is made slightly tricky by the fact that if elements x and y of 
5 are equivalent, then the equivalence class containing them may denoted equally well by [x] or 
[y]. That is, if x ~ y, then [x] = [y]. When we define some operation, say, on [x] we must be sure 
that the definition we give does not depend on the representative we choose from the equivalence 
class, that is, that it does not depend on the name we choose for the equivalence class. In other 
words we must show that the operation is well-defined. The next exercise shows how things may 
go horribly wrong if we are not careful about such matters. 

2.11.10. Exercise. The standard construction of the real numbers proceeds by first producing the 
positive integers and then using these to construct the set Z of all integers and to define addition 
and multiplication on them. The next step is to get the rational numbers. To do this let R be the 
set of all ordered pairs (m, n) of integers such that n 7^ 0. Define a relation ~ on i? as follows: 

(m, n) ~ (p, q) if mq = pn. 

(a) Show that ~ is an equivalence relation on R. 

Now according to proposition 2.11.7 the relation ~ partitions the set R into equivalence classes. 

m 

We denote the equivalence class containing (m,n) by — . The set of these equivalence classes is 

Ti- 
the set Q of rational numbers. 

(b) Now suppose we wish to define an operation © on Q as follows: 

m V m + p 

— © - := 

n q n + q 

Explain clearly why this definition makes no sense whatever; that is, explain why © is not 
well-defined. 

(c) Prove that the corresponding (usual) definition for multiplication is well-defined. 

m p mp 
n q nq 




CHAPTER 3 



CARDINALITY 

3.1. Finite and Infinite Sets 

There are a number of ways of comparing the "sizes" of sets. In this chapter we examine 
perhaps the simplest of these, cardinaHty. Roughly speaking, wc say that two sets have the "same 
number of elements" if there is a one-to-one correspondence between the elements of the sets. In 
this sense the open intervals (0, 1) and (0, 2) have the same number of elements. (The map x ^ 2x 
is a bijection.) Clearly this is only one sense of the idea of "size". It is certainly also reasonable to 
regard (0, 2) as being bigger than (0, 1) because it is twice as long. 

3.1.1. Definition. Two sets S and T are CARDINALLY EQUIVALENT if there exists a bijection from 

S onto T, in which case we write S ^ T. 

3.1.2. Proposition. On any family of sets cardinal equivalence is an equivalence relation. That 
is, if S, T , and U are sets, then 

(a) S ~ S; 

(b) ifS--T, then T ~ S; and 

(c) if S andT ^U, then 

3.1.3. Definition. A set S is finite if it is empty or if there exists n G N such that S is cardinally 
equivalent to some (unique) initial segment N„ = {!,..., n} of the natural numbers. A set is 
INFINITE if it is not finite. 

If 5 is a finite set, then it is in one-to-one correspondence with some N„. We say that n is the 

CARDINAL NUMBER of the set S and write n = card S. Wc take the cardinal number of the empty 
set, card0, to be 0. Thus for a finite set S the assertion that "the cardinality of S is n" is just 
another way of saying that "the set S has n elements" . 

3.1.4. Proposition. Let S* C T. IfTis finite, then S is finite and cardS" < cardT. 

Hint for proof . The case T = is trivial. Suppose T Let l: 5 — >■ T be the inclusion map 
of S into T. There exist n G N and a bijection / : T— > {l,...,n}. 

The preceding proposition "subsets of finite sets are finite" has a useful contrapositive: Sets 
which contain infinite sets are themselves infinite. 

3.1.5. Proposition. If S andT are disjoint finite sets, then 5 tfc) T is finite and 

card(S' l±l T) = card S + card T . 

3.1.6. Corollary. If S is a set and T is a finite set, then 

card(T \ S) = card T - card(r n S). 
In the preceding proposition notice that ii S C.T (where T is finite), then 

card(T \S) = card T - card S . 

3.1.7. Corollary. If S and T are finite sets, then SUT is finite and 

card(5' U T) = card S + card T - card{S n T) . 
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How do we show that a set S is infinite? If our only tool were the definition, we would face 
the prospect of proving that there does not exist a bijection from S onto an initial segment of 
the natural numbers. It would be pleasant to have a more direct approach than establishing the 
nonexistence of maps. This is the point of the next proposition. 

3.1.8. Proposition. A set is infinite if and only if it is cardinally equivalent to a proper subset of 
itself. 

Proof. Suppose that S is infinite. We prove that there exists a proper subset T o( S and a 
bijection / from S onto T. We choose a sequence of distinct elements in S, one for each A; G N. 
Let ai be an arbitrary member of S. Then S \ {oi} ^ 0. (Otherwise S ~ {«i} and S is finite.) 
Choose 02 G S" \ {ai}. Then S \ {ai, 02} 7^ 0. (Otherwise S ~ {ai, 02} and S is finite.) In general, 
if distinct elements oi, . . . , a„ have been chosen, then S \ {ai, . . . , a„} cannot be empty; so we may 
choose ttn+i £ S \ {oi, . . . , On}. Let T = S \ {ai}, and define /: — )• T by 

\ I Ofc+i, if X = Cfc for some k 
f[x) = < 

I X, otherwise. 

Then / is a bijection from S onto the proper subset T oi S. 

For the converse construct a proof by contradiction. Suppose that 5" ~ T for some proper 
subset T C 5, and assume further that S is finite, so that S ~ {1, . . . ,n} for some n G N. Then by 
proposition 3.1.4 the set 5 \ T is finite and, since it is nonempty, is therefore cardinally equivalent 
to {1, . ■ ■ ,p} for some p G N. Thus 

n = card S 
= card T 

= cavdiS \{S\T)) 

= card S — card(S' \ T) (by corollary 3.1.6) 
= n — p. 

Therefore p = 0, which contradicts the earlier assertion that p G N. □ 

3.1.9. Example. The set N of natural numbers is infinite. 

3.1.10. Example. The interval (0,1) is infinite. 

3.1.11. Example. The set M of real numbers is infinite. 

The next two results tell us that functions take finite sets to finite sets and that for injective 
functions finite sets come from finite sets. 

3.1.12. Proposition. If T is a set, S is a finite set, and f : S" — J- T is surjective, then T is finite. 

3.1.13. Proposition. If S is a set, T is a finite set, and f : S ^ T is injective, then S is finite. 

3.1.14. Exercise. If S" is a finite set with cardinal number n, what is the cardinal number of *P(S')? 

3.2. Countable and Uncountable Sets 

There are many sizes of infinite sets — infinitely many in fact. In our subsequent work we need 
only distinguish between countably infinite and uncountable sets. A set is countably infinite if it is 
in one-to-one correspondence with the set of positive integers; if it is neither finite nor countably 
infinite, it is uncountable. In this section we present some basic facts about and examples of both 
countable and uncountable sets. This is all we will need. Except for exercise 3.2.25, which is pre- 
sented for general interest, we ignore the many intriguing questions which arise concerning various 
sizes of uncountable sets. For a very readable introduction to such matters see [26], chapter 2. 
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3.2.1. Definition. A set is COUNTABLY infinite (or denumerable) if it is cardinally equivalent to 
the set N of natural numbers. A bijection from N onto a countably infinite set S is an enumeration 
of the elements of S. A set is countable if it is either finite or countably infinite. If a set is not 
countable it is uncountable. 

3.2.2. Example. The set E of even integers in N is countable. 

The first proposition of this section establishes the fact that the "smallest" infinite sets are the 
countable ones. 

3.2.3. Proposition. Every infinite set contains a countably infinite subset. 

If we are given a set S which we believe to be countable, it may be extremely difficult to 
prove this by exhibiting an explicit bijection between N and S. Thus it is of great value to know 
that certain constructions performed with countable sets result in countable sets. The next five 
propositions provide us with ways of generating new countable sets from old ones. In particular, 
we show that each of the following is countable. 

(1) Any subset of a countable set. 

(2) The range of a surjection with countable domain. 

(3) The domain of an injection with countable codomain. 

(4) The product of any finite collection of countable sets. 

(5) The union of a countable family of countable sets. 

3.2.4. Proposition. If S QT where T is countable, then S is countable. 

The preceding has an obvious corollary: If S '^T and S is uncountable, then so is T. 

3.2.5. Proposition. If f : 5" — )• T is infective and T is countable, then S is countable. 

3.2.6. Proposition. If f : S ^ T is surjective and S is countable, then T is countable. 

3.2.7. Example. The set N x N is countable. 

3.2.8. Example. The set {a; G Q : x > 0} is countable. 

3.2.9. Proposition. If S and T are countable sets, then so is S x T. 

3.2.10. Corollary. If Si, . . . , Sn are countable sets, then Si x ■ ■ ■ x Sn is countable. 
Finally we show that a countable union of countable sets is countable. 

3.2.11. Proposition. Suppose that A is a countable family of sets and that each member of A is 
itself countable. Then \JA is countable. 

3.2.12. Example. The set Q of rational numbers is countable. 

By virtue of 3.2.4-3.2.11 we have a plentiful supply of countable sets. We now look at an 
important example of a set which is not countable. 

3.2.13. Example. The set M of real numbers is uncountable. 

Proof. We take it to be known that if we exclude decimal expansions which end in an infi- 
nite string of 9's, then every real number has a unique decimal expansion. By (the corollary to) 
proposition 3.2.4 it will suffice to show that the open unit interval (0, 1) is uncountable. Argue by 
contradiction: assume that (0, 1) is countably infinite. (We know, of course, from example 3.1.10 
that it is not finite.) Let ri, r2, ra, . . . be an enumeration of (0, 1). For each j G N the number rj 
has a unique decimal expansion 

O.rji rj2 rjs .... 

Construct another number x = O.xi X2X3 . . . as follows. For each k choose Xjt = 1 if rkk 7^ 1 and 
Xk = 2 ii rkk = 1- Then x is a real number between and 1, and it cannot be any of the numbers 
rk in our enumeration (since it differs from at the k^^ decimal place). But this contradicts the 
assertion that ri, r2, ^3, . . . is an enumeration of (0, 1). □ 
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3.2.14. Example. The set of irrational numbers is uncountable. 

3.2.15. Proposition. If S is a countable set and T is uncountable, then T\S ^ T. 

3.2.16. Exercise. A finite word is an n-tuple of upper case letters of the English language 
written without parentheses or commas. (Examples: DOG, BFTLZXP, ARRRRGHHHH.) An 
infinite word is an (infinite) sequence of upper case English letters. 

(a) Prove or disprove: the set of finite words is countable. 

(b) Prove or disprove: the set of infinite words is countable. 

3.2.17. Notation. Let A and B be sets. Denote by the family of all functions from B into A. 

3.2.18. Proposition. If S is a set, then {0, 1}'^ is cardinally equivalent to ^{S), the power set 
ofS. 

3.2.19. Exercise. Your good friend Fred R. Dim argues that there are only countably many 
real numbers. After all, he says, there are only ten choices for each position in a number's decimal 
expansion; there are only countably many positions; and a countable union of finite sets is certainly 
countable. Explain carefully to Fred what is wrong with his argument. 

3.2.20. Exercise. Let e be an arbitrary number greater than zero. Show that the rationals in 

[0, 1] can be covered by a countable family of open intervals the sum of whose lengths is no greater 
than e. (Recall that a family it of sets is said to COVER a set A if A C.\Jii.) Is it possible to cover 
the set Q of all rationals in M by such a family? 

3.2.21. Example. (Definition: The OPEN DISK in with radius r > and center {p, q) is defined 
to be the set of all points {x,y) in such that {x — p)^ + {y — qf' < r^-) The family of all open 
disks in the plane whose centers have rational coordinates and whose radii are rational is countable. 

3.2.22. Example. (Definition: A real number is algebraic if it is a root of some polynomial of 
degree greater than with integer coefficients. A real number which is not algebraic is transcen- 
dental. It can be shown that the numbers tt and e, for example, are transcendental.) The set of 
all transcendental numbers in M is uncountable. Hint. Start by showing that the set of polynomials 
with integer coefficients is countable. 

3.2.23. Example. The set of all sequences each of whose terms is either or 1 is uncountable. 

3.2.24. Example. Let 5 be a disjoint family of intervals in M each with length strictly greater 
than 0. Then ^ is countable. 

3.2.25. Example. The family J^(R,R) of all real valued functions of a real variable is an uncount- 
able set that is not cardinally equivalent to M. Hint. Let J-" = ^'(M,]^). Assume there exists a 
bijection ^: M — >■ J". What about the function / defined by 

f{x) = l + {cl>{x)){x) 

for all X G M? 
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4.1. Operations 



4.1.1. Definition. Let S" be a set. A function from S into itself is sometimes called a unary 
OPERATION. A function from S x S into 5 is a binary operation. Usually when the word 
"operation" is used without a modifier "binary operation" is meant. 

4.1.2. Example. If 5 is a nonempty set, then the map A i— >• A'^, which takes a subset ^4 of 5 to 
its complement (in 5), is a unary operation on ^{S), the power set of S. 

4.1.3. Example. If S" is a nonempty set, the map which takes a pair of subsets A and B to their 
union AU B is a binary operation on ^{S). The notational convention here, putting the name 
of the function between the two operands, is standard. It is almost always the case that if * is a 
binary operation we write a * b or just ab for the value of * at (a, b) — rather than something like 
*{a,b). 

4.1.4. Definition. Let 5 be a set, * a binary operation on S, and ACS. The set A is CLOSED 
under * if a * 6 belongs to A whenever a and b are members of A. Similarly, we say that the set 
A is closed under a unary operation s i— )• s' on S if a' belongs to A whenever a does. In chapter 10 
we will encounter the notion of a set in a topological space being closed. One should be careful to 
distinguish this (topological) use of the word "closed" from the (algebraic) concept of a set being 
closed under some binary operation. 

4.1.5. Example. The set of real numbers r > is closed under the usual operations of addition, 
multiplication, and division, but not under subtraction. 

4.1.6. Definition. Let S" be a set and * a binary operation on S. The operation * is 

(a) ASSOCIATIVE 'd {s * t) * u = ,s * {t * u) for all s, t, u £ S; and 

(b) commutative if s * t = t * s for all s, t G S . 

If * and o are both binary operations on S and 



hold for ah s, t, u e S, we say that * distributes over o. Equation (4.1) is a left distributive 
LAW, and (4.2) is a right distributive law. 

4.1.7. Example. If 5" is a nonempty set the operations of union U and intersection n are both 
associative and commutative. Furthermore, union distributes over intersection and intersection 
distributes over union. In the real number system multiplication distributes over addition but 
addition does not distribute over multiplication. 

4.1.8. Definition. If 5" is a set and * is an associative binary operation on S, then the pair (S, *) 
is a SEMIGROUP. It is illogical, but standard, to say "let S be a semigroup" instead of "let {S, *) be 
a semigroup" and to say that the semigroup S is commutative when we mean that its operation * 
is. 



s * {tou) = (s *t) o (s * u) 
{tou)*s = (t * s) o (u* s) 



(4.1) 
(4.2) 
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4.1.9. Definition. Let S he a set and * a binary operation on S. An element Ig of S is an 

IDENTITY ELEMENT witll respect to * if 

for all s ^ S. 

4.1.10. Definition. Let S" be a set which has an identity element Is with respect to a binary 
operation * on S" and let s be an element of S. An element s' of S which satisfies s' * s = Is 
is a LEFT INVERSE of s (with respect to *). If it satisfies s * s' = Is it is a right inverse of s 
(with respect to *). It is an inverse of s if it is both a left and right inverse of s. An element is 
INVERTIBLE if it has an inverse. 

4.1.11. Definition. A semigroup (S,*) with an identity is called a monoid. 

4.1.12. Example. If S" is a nonempty set, the family J-'{S, S) of all functions from S into itself is 
a monoid under composition. 

4.1.13. Example. The real numbers under addition form a monoid; so do the real numbers under 
multiplication. 

4.1.14. Example. Monoids are used extensively in computer science. Search the Wikipedia [ ] 
for examples such as syntactic, transformation, trace, and history monoids. 

4.1.15. Proposition. The identity element of a monoid is unique. 

4.1.16. Proposition. An element of a monoid can have at most one inverse. 

4.1.17. Proposition. If a is an invertible element of a monoid, then is also invertible and 

{a-y = a. 

4.1.18. Proposition. If a and b are invertible elements of a monoid, then their product ab is also 
invertible and 

(ab)-^ = b-^a-\ 
We can improve on proposition 4.1.16 a bit. 

4.1.19. Proposition. // an element c of a monoid has both a left inverse and a right inverse 
c^ , then it is invertible and = c' = c^ . 

Hint for proof . Look at your proof of proposition 2.10.1. 

4.1.20. Corollary. Let a and b be elements of a monoid. If both ab and ba are invertible, then so 
are a and b. 

4.2. Groups 

4.2.1. Definition. A monoid (G, *) is a GROUP if every element in G has an inverse with respect 
to *. It is customary to say, "let G be an group" rather than, let (G,*) be a group". Usually the 
identity element of an arbitrary group G is denoted by Iq or simply by 1 if no confusion is likely 
to arise. The standard notation for the inverse of an element a in a group is a^^. 

A group whose operation * is commutative is usually called an Abelian group. In most cases 
the preferred notation for the binary operation of an Abelian group G is +. Its identity is then 
denoted by Og (or just by 0) and the inverse of an element a by —a. The notation a — 6 is shorthand 
for a + (— 6). In brief, arbitrary groups are written multiplicatively and (usually) Abelian groups 
additively. 

4.2.2. Example. In the real numbers Z, Q and M itself are Abelian groups under ordinary addition. 
The set N of natural numbers (under addition) is a commutative semigroup but not a group. 



4.3. HOMOMORPHISMS OF SEMIGROUPS AND GROUPS 
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4.2.3. Example. Let G be the set of nonzero real numbers and • be ordinary multiplication. Then 
(G, • ) is an Abelian group. The identity element is the number 1 and the inverse of x G G is its 
reciprocal. This is one of the few situations in which the operation of an Abelian group is (for 
obvious reasons) written multiplicatively (using •) rather than additively (using +). 

4.2.4. Example. Let Z2 = {0, 1} and define addition on Z2 to be addition modulo 2; that is, define 
+ = 1 + 1 = and + 1 = 1 + = 1. This makes Z2 into an Abelian group. 

4.2.5. Example. Recall that an m x n matrix of real numbers is a rectangular array of real 
numbers with m rows and n columns. If a is such a matrix, then aij is the element in the i^^ row 
and the j**^ column. It is common to use the notation [oij] for the matrix a itself. If you find the 
notion of "rectangular array" uncongenial, define an mxn matrix of real numbers to be a function 
from Nm X N„ into M. We denote by M„ the set of all n x n matrices of real numbers. Define 
addition of two n x n matrices coordinatewise; that is, if a, 6 G M„, define a + 6 to be the mxn 
matrix whose entry in the i^^ row and the j*^ column is Uij + bij. It is easy to see that this makes 
Mn into an Abelian group. 

4.2.6. Example. If S" is a set and G is an Abelian group, then J^{S,G) (see 2.3.2) is an Abelian 
group under POINTWISE ADDITION, which is defined for all f,g£ J^{S, G) by 

{f + 9){s) = f{s)+9{s) 

for all s €z S. Notice that in the displayed equation the + on the left side is the addition that 
is being defined in J-{S, G) while the one on the right is addition in G. It would be correct, but 
perhaps overly pedantic, to write 

(/ +.F{s,G) 9)is) = f{s) +a 9{s). 

4.2.7. Example. If 5" is a nonempty set, then (*P(S'), A), the power set of S under the operation 
of symmetric difference (see section 1.6), is an Abelian group. 

4.2.8. Example. A real valued function / defined on a subset of the real line belongs to J-q if its 
domain contains some open interval which contains 0. For f,g^J^o define f + g pointwise; that 
is, let 

U + g)ix) = fix)+gix) 
for all X common to the domains of / and g. 

(a) Explain why J^q is not an Abelian group under this operation. 

(b) Suppose we say that two functions / and g in J^q are "equivalent" if f{x) = g{x) for all x 
in some open interval containing 0. Show that this relation really is an equivalence relation 
and explain carefully how the set of equivalence classes can be made into an Abelian group 
under "pointwise addition". (These equivalence classes are called GERMS of functions.) 

4.2.9. Proposition. Let a he an element of an arbitrary group. If a? = a, then a = 1. 

4.2.10. Definition. Let G be a group and i7 be a subset of G containing the identity element of 
G. Then H is a SUBGROUP of G if it is closed under the group operation of G and is a group under 
that operation. 

4.2.11. Proposition. The intersection of a family of subgroups of a group G is itself a subgroup 
ofG. 

4.3. Homomorphisms of Semigroups and Groups 
4.3.1. Definition. Let {G, □) and (H,* ) be semigroups. A map f : G ^ H is a semigroup 

HOMOMORPHISM if 

fix □ y) = fix) * fiy) 

for all X, y € G. We will denote by IIom(G, H) the set of all homomorphisms from G into H and 
wiU abbreviate Hom(G, G) to Hom(G). A GROUP homomorphism is just a homomorphism of the 
corresponding semigroups. 
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Here is another, diagrammatic, way of saying the same thing: the function / is a homomorphism 
if the diagram 

G xG ^-^^^H X H 

□ * 
G 

f 

commutes. 

4.3.2. Definition. Let (G, □) and {H,*) be semigroups. An ISOMORPHISM from G to is a 
homomorphism f : G ^ H such that exists and is also a homomorphism. If an isomorphism 
from G to H exists, then G and H are isomorphic. 

4.3.3. Example. Let G be the Abehan group of real numbers under addition and H be the 
Abelian group {x G M: x > 0} of strictly positive real numbers under multiplication. The map 
exp: G — )• iJ: X I— )■ is an isomorphism. (We take the most basic properties of the elementary 
functions as prerequisite material. For more on this see the disclaimer in section 6.3.) 

4.3.4. Proposition. Every bijective homomorphism of semigroups is an isomorphism. 

4.3.5. Exercise. Let 1 be the one element group and / : G — )• be a homomorphism between 
arbitrary groups G and H. Show that the following diagram of homomorphisms commutes. 

1 



G 




H 



What does this result say in ordinary English? 

The preceding result need not hold in monoids. It is common practice to define a monoid 
homomorphism as a semigroup homomorphism which preserves the identity element. 

4.3.6. Example. Regard M as a monoid under ordinary multiplication and M'^ as a monoid under 
pointwise multiplication (where {a,b){c,d) = {ac,bd)). Then the inclusion map a i— )• (a, 0) of M 
into is a semigroup homomorphism which does not preserve the identity element. 

In addition to preserving identity elements group homomorphisms also automatically preserve 
inverses. 

4.3.7. Proposition. Let f : G ^ H be a group homomorphism. Then f[x~^^ = (/(x)) ^ for each 
xGG. 

4.3.8. Definition. Let G and H be Abelian groups. For / and g in Hom(G, define 

f + g:G-^H: f{x) + g{x). 

4.3.9. Proposition. Let G and H be Abelian groups. With addition as defined above IIom(G, H) 
is an Abelian group. 

4.3.10. Notation. Let G, H, and J be Abelian groups and f : G ^ H and g: H — J be 
homomorphisms. Usually the composite of g with / is denoted by gf (rather than by g o f). 
That is, 

gf:G-^J: x ^ g{f{x)). 

4.3.11. Proposition. Let G, H, and J be Abelian groups. If f £ Hom(G, H) and g G Hom(ff, J), 
then the composite of g with f belongs to Hom(G, J). 
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4.3.12. Definition. Let G and H be Abelian groups and f : G ^ H he a homomorphism. Then 
{x e G: f{x) = 0} is the KERNEL of / and is denoted by ker/. 

4.3.13. Proposition. Let f : G ^ H be a homomorphism of Abelian groups. Then ker / is a 
subgroup of G and ran / is a subgroup of H. 

4.4. Vector Spaces 

4.4.1. Definition. Let F be a field. A vector space over F is an Abehan group V together 
with a mapping m:¥xV^V: {a,v) i-^ av, called scalar multiplication, which satisfies the 
following axioms for all a, /3 G F and x, y £ V: 

(i) a{x + y) = ax + ay; 

(ii) (a + /3)x = ax + {3x; 

(iii) {af3)x = a{(3x); and 

(iv) Ix = X. 

The elements of V are called vectors and, in this context, the elements of F are called SCALARS. 

Definitions of this sort, especially if used thoughtlessly, can make mathematicians unpopular in 
some circles. Suppose an engineer comes up to a mathematician and asks in all good faith. You're 
an expert at this linear algebra stuff. What is a vector, really? It is unhelpful and perhaps a bit 
insensitive — but of course entirely accurate — to reply. Well, it 's just an element of a vector space. 

A vector space over the field M of real numbers (see section 6.1) is a real vector space and 
one over the field C of complex numbers (see section 6.6) is a complex vector space. In these 
notes the only vector spaces which occur are the ones with real or complex scalars. Since the real 
vector spaces occur most frequently we make the following agreement. 

4.4.2. Convention. In the sequel the unmodified term "vector space" will mean "real vector 
space". When we want a vector space V to have complex scalars, we will say so: we will say Let 
V be a complex vector space. We will ordinarily follow the usual convention and write "Let y be a 
vector space," when to be correct we should write "Let (V,+,m) be a vector space," or even more 
precisely, "Let (F, +,m, M)) be a real vector space." 

4.4.3. Example. The vector space V = {0} comprising a single element is the zero vector 
space. When we say that a vector space is nontrivial we mean that it is not the zero vector 
space. 

4.4.4. Example. The simplest nontrivial example of a (real) vector space is the set M of real 
numbers itself. Here both vectors and scalars are real numbers and scalar multiplication is ordinary 
multiplication of real numbers. 

4.4.5. Example. Euclidean n-space M" is a vector space under coordinatewise operations of ad- 
dition and scalar multiplication: 

(Xl, ...,Xn) + {yi,.. .,yn) ■= {Xi +2/1, . . .,Xn + yn) 

and 

(3?! , . . . , Xn) . — (q;Xi,...,0 Xn) 

for aeR, x = (xi,...,x„) e M", and y = (yi,...,y„) G M". 

4.4.6. Example. The Abelian group M„ of n x n matrices of real numbers (see example 4.2.5) 
becomes a vector space when we define scalar multiplication coordinatewise: 

aa := [a Uij] 

for a G M and a = [oij] G M„(M). 
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4.4.7. Example. If 5 is a set and F is a vector space, then the Abehan group T{S,V) (see 
example 4.2.6) becomes a vector space under pointwise scalar multiplication: 

(a/)(s) ■=af{s) 

for a G M and / G ^(5*, V). Recall that for the special case of real valued functions we write J-{S) 
instead of J^(S', M). 

4.4.8. Proposition. Let x be a vector in some vector space. Then (—1) x = —x. 

4.4.9. Proposition. Let x be a vector in some vector space and a be a scalar. Then ax = if 
and only if a = or x = 0. 

4.4.10. Notation. If x and y are vectors in some vector space, we define x — y tohe x + (— y)- If 
A and B are subsets of a vector space, we define 

A + B := {a + b : a e A,b e B}; 

and if a G M, 

aA := {aa : a G A}. 

We also take —A to be (— 1)^4, which, according to exercise 4.4.8, is the same as {—a: a G A}. And 
of course, A — B means A + {—B). 

4.4.11. Definition. A nonempty subset of a vector space y is a SUBSPACE of V if it is a vector 
space under the operations of addition and scalar multiplication it inherits from V. 

4.4.12. Proposition. A nonempty subset of a vector space V is a subspace of V if it is closed 
under the operations of addition and scalar multiplication ofV. 

4.4.13. Definition. A real valued function / on a set S is bounded if there exists a number 
M > such that < M for all s G 5". We denote the family of all bounded real valued 
functions on S by ^(5", M) or, more briefly, by B{S). 

AAAA. Example. If 5 is a nonempty set, then the family B{S) = ;B(S', M) of bounded real valued 
functions on 5 is a subspace of the vector space F{S) (see example 4.4.7). 

4.5. Linear Transformations 
4.5.1. Definition. A map T: V ^ W between vector spaces is linear if 

T{x + y) = Tx + Ty for ah x,yeV (4.3) 

and 

T{ax) = aTx for all x e V and a G M. (4.4) 

A linear map is frequently called a linear transformation, and, in case the domain and 
codomain are the same, it is usually called a linear operator. The family of all linear transfor- 
mations from V into W is denoted by £{V, W). We shorten £(y, V) to £^{V). 

Two oddities of notation concerning linear transformations deserve comment. First, the value 
of T at a; is usually written Tx rather than T{x). Naturally the parentheses are used whenever 
their omission would create ambiguity. For example, in (4.3) above Tx + y is not an acceptable 
substitute for T(x + y). Second, as with group homomorphisms, the symbol for composition of two 
linear transformations is ordinarily omitted. If S G 2{U, V) and T G W), then the composite 
of T and S is denoted by TS (rather than by T o 5) . As a consequence of this convention when 
T G 2.{V) the linear operator T o T is written as T^, T o T o T as T^, and so on. 

The first condition (4.3) defining linearity says that a linear map must be a homomorphism 
of Abelian groups. We have already seen (at the beginning of section 4.3) how to view this as a 
commutative diagram. Condition (4.4) of the definition can similarly be thought of in terms of a 
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diagram. For each scalar a define the function M^, multiplication by a, from a vector space 
into itself by 

Maix) = ax . 

(We use the same symbol for multiplication by a in both of the spaces V and W.) Then condition 
(4.4) holds if and only if for every scalar a the following diagram commutes. 

V W 

V > w 

T 

4.5.2. Example. If T: : x H> (xi + 2:3, xi - 2x2), then T is linear. 

4.5.3. Proposition. Let T: V ^ W be a linear transformation between vector spaces. Then 

(a) T(0) = 0. 

(b) T{x — y) = Tx — Ty for all x, y £ V. 

4.5.4. Example. Each coordinate projection defined on M" 

TTfc : M*^ M : X H- Xfc 

is linear. 

4.5.5. Example. Let = J-'((a,6),M) be the family of all real valued functions defined on the 
open interval (a, b) and let V = P((a, fe), M) be the set of all members of J- which are differentiable 
at each point of (a, b). Then 2? is a vector subspace of J- and that the differentiation operator 

D:V^J^:f^f' 

(where /' is the derivative of /) is linear. 

An important observation is that the composite of two linear transformations is linear. 

4.5.6. Proposition. Let U, V, and W be vector spaces. If S e £(C/, and T G 2.{V,W), then 
TS G 2.{U,W). 

4.5.7. Definition. If T : V \sa linear transformation between vector spaces, then the KERNEL 
(or NULL space) of T, denoted by ker T, is its kernel as a group homomorphism (see 4.3.12). That 
is, 

kerT = T^{0} = {x ^V:Tx = 0}. 

4.5.8. Exercise. In we call the vectors = i = (1,0,0), = j = (0, 1,0), and = k = (0,0,1) 
the STANDARD BASIS VECTORS for M^. Suppose that T G £(M^,M^) satisfies 

rei = (1,-2,3) 

Te^ = (0,0,0) 

Te^ = (-2,4,-6) . 

Find and describe geometrically the kernel of T and the range of T. 

It is useful to know that the kernel of a linear transformation is always a vector subspace of 
its domain, that its image is a vector subspace of its codomain, and that a necessary and sufficient 
condition for a linear transformation to be injective is that its kernel contain only the zero vector. 

4.5.9. Proposition. IfT: V ^ W is a linear transformation between vector spaces, then kerT is 
a subspace ofV and ranT is a subspace ofW. 

4.5.10. Proposition. A linear transformation T is injective if and only if ker T = {0}. 

4.5.11. Proposition. Every bijective linear map between vector spaces is an isomorphism. 
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Hint for proof . Part of this has already been done in exercise 4.3.4. 

4.5.12. Definition. Let y be a vector space. A hnear map from a (real) vector space into its field 
M of scalars is a linear functional. We will denote the family of all linear functionals on V by 

rather than by M). This space is the algebraic dual space of V. 

4.5.13. Example. Let 5 be a nonempty set and V = J-{S) be the vector space of all real valued 
functions on S. For every x & S the function 

E,:V^R:f^f{x) 

is a linear functional on V. The functional belongs to and is called the evaluation 
functional at x. 

4.5.14. Example. We know from beginning calculus that the family TZ = 7^([a, 6], M) of Riemann 
integrable functions is closed under the pointwise operations of addition and multiplication by real 
scalars. Also recall that 

pb nb 

/ af{x) dx = a f{x) dx 

J a J a 

where a e M and f £ TZ, and that 

pb rb rb 

I {fix) + g{x))dx = j f{x)dx+ g{x)dx 

J a J a J a 

where f,g £ TZ. It is then clear that 7^ is a vector subspace of and that the function / i— )■ 
la ■^(^) ^ linear functional on TZ. 



4.6. Rings and Algebras 

4.6.1. Definition. Let {R, +) be an Abelian group and • a binary operation on R (which we 
call multiplication). Then (i?, +, •) is a RING if the multiplication operation is associative and 
distributes over addition. If, in addition, there exists an identity element 1r in R with respect to 
multiplication (sometimes called a unit), then we say that (ii, +,•) is a unital ring. A ring is 
said to be commutative if the multiplication operation is commutative. 

Since a unital ring has an identity element with respect to two different operations we distinguish 
them by referring to the additive identity and to the multiplicative identity. We also refer to additive 
inverses and (when they exist) multiplicative inverses. Most often we write ab for a ■ b. 

We customarily say, "Let i? be a ring," rather than, "Let {R, +, m) be a ring." As a matter of 
notation it is usually harmless to denote the multiplicative identity of a unital ring by 1. If a and 
b are elements of a ring, then ab is called the product of a and b and is occasionally written as a • 6. 
We write for a ■ a, and for a ■ a ■ a, and so on. 

4.6.2. CAUTION. Many authors define rings to be both unital and commutative. 

4.6.3. Example. Under the additional operation of ordinary multiplication the Abelian groups Z, 
Q, and M (see example 4.2.2) become commutative unital rings. 

4.6.4. Example. Let Z2 be the Abelian group given in example 4.2.4. If we define • = • 1 = 
1-0 = and 1-1 = 1, then Z2 becomes a commutative unital ring. 

4.6.5. Example. Let S" be a nonempty set. According to example 4.2.7 (^(5), A) is an Abelian 
group. With intersection as "multiplication" (*p(S'), A,n) is a commutative unital ring. 

4.6.6. Example. Let G be an Abelian group. Then Hom((jr) is a unital ring under pointwise 
addition and composition. (See exercise 4.3.9.) 
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4.6.7. Definition. Let {A, +, • ) be a ring which is additionally equipped with a scalar multiplica- 
tion m such that (^4, +, m) is a vector space. If 

a{xy) = {ax)y = x{ay) (4-5) 

for each x, y G A and a G M, then [A, +, • , m) is a (real) algebra (sometimes referred to as a 
LINEAR ASSOCIATIVE ALGEBRA). Of course, if ^ is a complex vector space, condition (4.5) should 
hold for all complex numbers a. We abuse notation in the standard way by writing such things as, 
"Let A be an algebra." We say that an algebra A is unital if its underlying ring [A, +, • ) is. And 
it is COMMUTATIVE if its ring is. 

4.6.8. Convention. Algebras, like vector spaces, may have either real or complex scalars. In these 
notes algebras will be assumed to have real scalars unless we specify otherwise. Later on when we 
want the scalars of an algebra A to come from the field C of complex numbers, we will say that A 
is a complex algebra. 

4.6.9. Example. If 5* is a nonempty set, then the vector space J-^{S) (see example 4.4.7) is a 
commutative unital algebra under pointwise multiplication, which is defined for all f,gG J^{S) 
by 

if-g){s) = f{s)-gis) 

for all s G S. The constant function 1 (that is, the function whose value at each s G S is 1) is the 
multiplicative identity. 

4.6.10. Example. We have seen in example 4.4.6 that the set M„ of n x n matrices of real numbers 
is a vector space. If a = [ajj] and b = [b^i] are n x n matrices, then the product of a and b is 
the n X n matrix c = ab whose entry in the i^^ row and k^^ column is Cik = Yl^=i ^^ij^jk ■ This 
definition makes M„ into a unital algebra. 

4.6.11. Example. If V is a vector space, then the set 2.{V) of linear operators on 1/ is a unital 
algebra under pointwise addition, pointwise scalar multiplication, and composition. 

4.6.12. Definition. An element a of a ring R is an annihilator of i? if ax = j;o = for every 
X e R. 

4.6.13. Proposition. The additive identity of a ring is an annihilator. 

4.6.14. Proposition. If a andb are elements of a ring, then {—a)b = a{—b) = —ab and (—a) (—6) = 
ab. 

4.6.15. Proposition. Let a and b be elements of a unital ring. Then 1 — ab is invertible if and 
only if 1 — ba is. 

Hint for proof . Look at the product of 1 — 6a and 1 + bca where c is the inverse of 1 — ab. 

4.6.16. Definition. A nonzero element a of a ring is a zero divisor (or divisor of zero) if 
there exists a nonzero element b of the ring such that a6 = or 6a = 0. 

4.6.17. Definition. An element a of a ring (or, more generally, a semigroup) is left cancellable 
if a6 = ac implies that b = c. It is right cancellable if 6a = ca implies that 6 = c. A ring (or a 
semigroup) has the cancellation property if every nonzero element of the ring is both left and 
right cancellable. 

4.6.18. Proposition. A ring has the cancellation property if and only if it has no zero divisors. 

4.6.19. Definition. A field is a commutative unital ring R in which the additive and multiplica- 
tive identities are distinct {Ir 7^ Or) and every nonzero element x has a multiplicative inverse x~^. 

4.6.20. Example. The rings M and Q are fields; Z is not. 

4.6.21. Example. The ring Z2 (see example 4.6.4) is a field. 
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4.6.22. Definition. An element a of a ring (or, more generally, a semigroup) is idempotent if 
a? = a. A ring i? is a Boolean ring if every element of the ring is idempotent. 

4.6.23. Example. The ring Z2 (see example 4.6.4) is a unital Boolean ring. 

4.6.24. Example. Let 5 be a nonempty set. The family 7^(5, Z2) of all functions from S into the 
field Z2 (see example 4.6.21) under pointwise addition and multiplication is a unital Boolean ring. 

4.6.25. Example. Let S" be a nonempty set. The ring (*p(S'),A,n) (see example 4.6.5) is a 
Boolean ring. 

4.6.26. Proposition. For every element a of a Boolean ring a + a = 0. 

4.6.27. Proposition. Every Boolean ring is commutative. 

4.6.28. Definition. A subset of an algebra A which is closed under the operations of addition, 
multiplication, and scalar multiplication is a subalgebra of A. If ^ is a unital algebra and B \s a, 
subalgebra of A which contains the multiplicative identity of A, then i? is a unital subalgebra 
of ^. 

4.6.29. CAUTION. Be very careful with the preceding definition. It is possible for to be a 
subalgebra of an algebra A and to be a unital algebra but still not be a unital subalgebra of A\ 
The definition requires that for i? to be a unital subalgebra of A the identity of B must be the 
same as the identity of A. Example: Under pointwise operations ^ = is a unital algebra. The 
set B = {(x,0): x E M} is a subalgebra of A. And certainly B is unital (the element (1,0) is the 
multiplicative identity of B). But B is not a unital subalgebra of A because it does not contain 
the multiplicative identity (1, 1) of j4. 

4.6.30. Example. Let 5 be a nonempty set. The family B{S) of all bounded real valued functions 
on 5 is a unital subalgebra of the algebra J-{S) of all real valued functions on S. 

4.6.31. Definition. A left ideal in an algebra yl is a vector subspace J of ^ such that AJ C J. 
(For RIGHT IDEALS, of course, we require J A C J.) We say that J is an ideal if it is both a left 
and a right ideal. A proper ideal is an ideal which is a proper subset of A. A maximal ideal 
is a proper ideal that is properly contained in no other proper ideal. We denote the family of all 
maximal ideals in an algebra A by Maxj4. 

4.6.32. Proposition. No proper ideal in a unital algebra can contain an invertible element. 

4.6.33. Proposition. Let a be an element of a commutative unital algebra A. If a is not invertible, 
then aA is a proper ideal in A. (It is called the principal ideal generated by a.) 

4.7. Ring and Algebra Homomorphisms 

4.7.1. Definition. A function f : R ^ S between rings is a (ring) homomorphism if it is a 
homomorphism of Abelian groups and preserves multiplication as well as addition; that is, if 

f{x + y) = f{x) + f{y) 

and 

f{xy) = f{x)f{y) (4.6) 
for all x and y in R. If additionally R and S are unital rings and 

/(li?) = Is (4.7) 

then / is a unital (ring) homomorphism. Obviously a ring homomorphism /: ii — t- S" is a group 
homomorphism of R and S regarded as Abelian groups. The kernel of / as a ring homomorphism 
is the kernel of / as a homomorphism of Abelian groups; that is ker/ = {x ^ R: f{x) = 0}. 

If f~^ exists and is also a ring homomorphism, then / is an ISOMORPHISM from R to S. If an 
isomorphism from R to S exists, then R and S are ISOMORPHIC. 
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4.7.2. Definition. A map f : A ^ B between algebras is an (algebra) homomorphism if it 
is a linear map between A and B as vector spaces which preserves multiplication (in the sense of 
equation (4.6). In other words, an algebra homomorphism is a linear ring homomorphism. It is 
a UNITAL (algebra) HOMOMORPHISM if it preserves identities (as in (4.7)). The kernel of an 
algebra homomorphism / : A — )• S is, of course, {a £ A: f{a) = 0}. 

If exists and is also an algebra homomorphism, then / is an isomorphism from A to B. 
If an isomorphism from A to B exists, then A and B are isomorphic. 

Here are four essentially obvious facts about algebra homomorphisms. 

4.7.3. Proposition. Every bijective algebra (or ring) homomorphism is an isomorphism. 

4.7.4. Proposition. If f : A ^ B is an isomorphism between algebras (or rings) and A is unital, 
then so is B and f is a unital homomorphism. 

4.7.5. Proposition. Let A, B, and C be algebras (or rings). If f : A ^ B and g: B ^ C are 

homomorphisms, so is gf: A ^ C. (As is the case with group homomorphism and linear maps, gf 
denotes the composite function g o f.) If f and g are unital, so is gf. 

4.7.6. Proposition. If f : A ^ B is a bijection, A is an algebra, and B is any set, then we may 
equip B with algebraic operations in such a way that B is an algebra isomorphic to A. 

4.7.7. Remark. It is clear that the preceding proposition remains true if we substitute for the 
word "algebra" the name of nearly any type of algebraic object — semigroup, group, ring, field, 
vector space. It is worth remarking on a slight variation of this proposition, which can also be quite 
useful. 

Suppose that f : A ^ B is a bijection where A is one of the algebraic structures we have dis- 
cussed. For concreteness let's take A to be an algebra. Suppose further that i? is a set equipped 
with corresponding algebraic operations (in this case, addition, multiplication, and scalar multi- 
plication) and that / is operation preserving. Then B is itself an algebra and / is an (algebra) 
isomorphism. 

4.7.8. Example. The algebras M„ (see 4.6.10) and i2(]R") (see 4.4.5 and 4.6.11) are isomorphic. If 
you found it irksome to prove that matrix multiplication was associative in example 4.6.10, explain 
how to use remark 4.7.7 to simplify that proof. 

4.7.9. Example. Let 5 be a nonempty set. Notice that each element of J-"(S', Z2) is "essentially" 
a characteristic function on S except that it is Z2 valued rather than real valued. Thus we will 
denote a member / of J-'(5, Z2) by if ^ = /^({l})- The map 

is an isomorphism between the Boolean rings ^{S) and J-"(S', Z2) (see examples 4.6.24 and 4.6.25). 
If you found it clumsy to write a proof of the associative property for symmetric difference in 
proposition 1.6.4(a), use remark 4.7.7 to write a neater one. 
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PARTIALLY ORDERED SETS 

5.1. Partial and Linear Orderings 

5.1.1. Definition. Let < be a relation on a nonempty set S. 

(a) If the relation < is reflexive and transitive, it is a preordering. 

(b) If < is a preordering and is also antisymmetric, it is a partial ordering. 

(c) Elements x and ?/ in a preordered set are comparable if either x < y or y < x. 

(d) If < is a partial ordering with respect to which any two elements are comparable, it is a 
linear ordering (or a total ordering). 

(e) If the relation < is a preordering (respectively, partial ordering, linear ordering) on S, then 
the pair {S, <) is a preordered set (respectively, partially ordered set, linearly 
ordered set). 

(f) A linearly ordered subset of a partially ordered set (5, <) is a CHAIN in S. 

We may write 6 > a as a substitute for a < b. The notation a < 6 ( or equivalently, b > a) means 
that a < b and a ^b. 

5.1.2. Example. The real line M is a linearly ordered set under its usual ordering <. 

5.1.3. CAUTION. The most important of the preceding concepts is partial ordering. The nota- 
tion < for partial orderings tends to be somewhat misleading. Since the usual < relation on the 
real numbers is a linear ordering it is an easy (and disastrous) mistake to treat a general partial 
ordering as a linear ordering, which usually it is not. Keep firmly in mind that two elements of a 
partially ordered set need not be comparable. A space of functions (under pointwise ordering, see 
example 5.L8 below) is a much more reliable mental picture of a partially ordered set than the set 
M of real numbers. 

5.1.4. Example. On define a relation < by 

(a, b) ■< (c, d) if a < c. 
Then ^ is a preordering which is not a partial ordering. 

5.1.5. Example. On M? define COORDINATEWISE ordering ■< by 

(a, b) < (c, d) if a < c and b < d. 
Coordinatewise ordering is a partial ordering on which is not a linear ordering. 

5.1.6. Example. On define lexicographic ordering < by 

(a, b) :< (c, d) if either a < c or else a = c and b < d. 

Lexicographic ordering is a linear ordering on M^. (Where do you think the name lexicographic 
ordering comes from? Hint: look up the word "lexicographic" in a dictionary.) 

5.1.7. Example. Let S" be a set which contains at least two elements. Then set inclusion is a 
partial ordering on the power set ^{S) which is not a linear ordering. (Why did we require S to 
have at least two elements?) 

5.1.8. Example. If 5 is a nonempty set, then J^{S) = J^{S,R) can be given a pointwise order- 
ing by setting f < g '\i f{s) < g{s) for every s £ S. Under this ordering J-^{S) is a partially ordered 
set. If S contains more than one point, the ordering is not linear. 
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5.2. Infima and Suprema 

5.2.1. Definition. Let < be a partial ordering on a set S and A Q S. An element Z € S is a 
LOWER BOUND for A if /< a for all a £ A. In this case we also say that A is bounded below (or 
minorized) by I. Similarly, an element u G S* is an upper bound for ^ if « > a for ah a G A. In 
this case we also say that A is bounded above (or majorized or dominated) by u. 

The element I is the greatest lower bound (or infimum) of A if 

(i) I is a lower bound for A, and 

(ii) if m is any lower bound for A, then / > m. 

In general we denote the infimum of a set A by either inf j4 or /\A. When A = {ai, . . . ,a„} is 
finite we usually write oi A 02 A • • • A a„ or Afc=i for /\A. U A = {ai, 02, • • • } is denumerable, 
we may write AfcLi for A ^• 

Similarly, the element / is the least upper bound (or supremum) of yl if / is an upper bound 
for A, and if whenever M is any upper bound for A, then I < M. We denote the supremum of a 
set A by either sup A or V ^- When A = {oi, . . . , o„} is finite we usually write oi V 02 V • • • V a„ 
or Vfc=i ^k foi' V ^- If ^ = {oi, 02, • • • } is denumerable, we may write Vfcli ^k for V ^• 

The "sup" in "sup A" is pronounced like the English word "soup" not like the English word 
"sup" (to dine). 

The plural of "infimum" is "infima" ; the plural of "supremum" is "suprema" . 

A subset of a partially ordered set may have many upper bounds, but the definite article in the 
definition of "supremum" requires justification. 

5.2.2. Proposition. A subset of a partially ordered set can have at most one supremum and at 
most one infimum. 

5.2.3. Proposition. Let A and B he nonempty subsets of a partially ordered set S with B (1 A. 
If both A and B have least upper bounds, then supB < sup A. Also, if both A and B have greatest 
lower bounds, then ini B > ini A. 

5.2.4. Definition. Let < be a partial ordering on a set S and A <^ S. An element m G vl is a 
maximal element of A if there is no element of A strictly larger than m; that is, if a = m whenever 
a & A and a > m. Similarly, m G ^ is a MINIMAL element of A if a = m whenever a ^ A and 
a < m. An element m G A is the largest (or greatest) element of ^ if m > a for all a e A. It 
is the SMALLEST (or least) element of ^ if m < a for all a G A. 

Clearly if m is the largest element of a set A it is a maximal element of A; in fact, it is the one 
and only maximal element of A. 

5.2.5. Example. Let S = {a, b, c, d} and Q be the family of all proper subsets of S. Then {a, b, c}, 
{a,b,d}, {a,c,d}, and {b,c,d} are the maximal elements of Q, but Q has no largest element. 

5.2.6. Definition. A partially ordered set is order complete if every nonempty subset has 
a supremum and an infimum. It is Dedekind complete if every nonempty subset which has 
an upper bound has a least upper bound. Clearly every order complete partially ordered set is 
Dedekind complete. Occasionally it is useful to have one more closely related concept. A partially 
ordered set is cj-Dedekind complete (or Dedekind cj-complete or countably Dedekind 
complete) if every nonempty countable subset which is bounded above has a least upper bound. 
(The prefix "u" is the Greek letter "sigma". It usually denotes something to do with countability.) 

5.2.7. Example. The power set of a nonempty set (ordered by set inclusion) is order complete. 

5.2.8. Example. The set of real numbers (under its usual ordering) is Dedekind complete (this is 
just the least upper bound axiom — see 6.1.6) but not order complete. 

5.2.9. Proposition. If S is a Dedekind complete partially ordered set, then every nonempty subset 
of S which has a lower bound has a greatest lower bound. 
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Hint for proof . Suppose S is a nonempty subset of S which is bounded below. Notice that the 
set of all lower bounds of B is bounded above. 

5.2.10. Definition. A function f : S ^ T between preordered sets is order preserving if 
f{x) < f{y) in T whenever x < y m. S. It is order reversing if f{x) > f{y) in T whenever 
X < y m. S. We say that / is an order isomorphism if it is bijective and x < y in S* if and only 
if f{x) < f{y) in T. 

5.2.11. Exercise. Give an example to show that a bijective order preserving map between partially 
ordered sets need not be an order isomorphism. 

It is helpful to have a mental picture of objects with which one is working. Because of our 
familiarity with the ordering of the real line it may be tempting to use it as a visual model for a 
typical partially ordered set. In many situations this turns out to be a bad idea. The fact that 
the line is linearly ordered makes it too special to be a good guide. A much better visualization 
is a standard garden trellis or the family of continuous functions on the unit interval [0, 1] (with 
pointwise ordering — see example 5.1.8). The next exercise shows the sort of thing that can go 
wrong when M is used as a visual model for a partially ordered set. 

5.2.12. Exercise. Your good friend Fred R. Dimm is trying to prove a result about a partially 
ordered set S. He is able to show (correctly) that for a particular subset A of S, there exists an 
element c G S such that c < sup A. He then concludes from this that there must be at least one 
element a of A such that c < a. Help Fred by finding a concrete example where this conclusion is 
false. 

5.3. Zorn's Lemma 

An axiom of set theory which is equivalent to the axiom of choice (see axiom 2.10.5 and 2.10.6) 
is the curiously named Zorn's lemma. 

5.3.1. Axiom (Zorn's lemma). A partially ordered set in which every chain has an upper bound 
has a maximal element. 

One standard and elementary use of Zorn's lemma is to show that every vector space has a basis. 
Before proving this result we give a little background, with which you may be already familiar. 

5.3.2. Definition. A vector y is a linear combination of the (distinct) vectors xi, . . . , x„ if 
there exist scalars qi, ... a™ such that y = X^^^^ ctfcXfc. Note: linear combinations are finite sums. 
The linear combination X]fc=i '^k^k is TRIVIAL if all the coefficients ai, . . .an are zero. If at least 
one Ofc is different from zero, the linear combination is NONTRIVIAL. 

5.3.3. Example. In the vector (8, 2) is a linear combination of the vectors (1, 1) and (1, —1). 

5.3.4. Example. In the vector (1,2,3) is not a linear combination of the vectors (1,1,0) and 
(1,-1,0). 

5.3.5. Definition. Suppose that A is a subset (finite or not) of a vector space V . The span of A 
is the set of all linear combinations of elements of A. For a nonempty set A there is another way of 
saying the same thing: the span of A is the smallest subspace of V which contains A. We denote 
the span of A by span A. \iU = span A, we say that A spans U or that U is spanned by A. 

5.3.6. Remark. There are many instances in mathematics when we have a set A of elements 
belonging to some sort of object and we say that "j4 generates a doohickey of type X" . Alternatively, 
we speak of, "the smallest doohickey of type X which contains A" . There are frequently two ways 
of defining the appropriate doohickey — a constructive way and, for the lack of a better term, an 
abstract way. The constructive way gives you a more or less explicit recipe for constructing the 
desired doohickey from the elements of A. The abstract approach observes that the intersection 
of a family of doohickeys of type X is again a doohickey of type X, and so the smallest type X 
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doohickey containing A is the intersection of all the type X doohickeys which contain A. It turns 
out to be very convenient to have both of these approaches available. But, unfortunately, it is not 
always possible. Whenever we do have both approaches to the "doohickey generated by j4", it is 
important to prove that they are equivalent, that is, to show that the two approaches generate 
exactly the same thing. The next exercise 5.3.7) gives a concrete example of how this works in 
practice. 

5.3.7. Exercise. Show that the assertion made in the preceding definition makes sense and is 
correct. That is, show first that there is a "smallest" subspace containing a given nonempty set 
A of vectors. {Hint. Try the intersection of the family of all subspaces containing A.) Then show 
that the "constructive" definition of the span of A (the set of all linear combinations of elements 
of A) is the same as the "abstract" characterization (the smallest subspace containing A). 

5.3.8. Remark. Occasionally one must consider the trivial question of what is meant by the 
span of the empty set. According to the "abstract" definition above it is the intersection of all the 
subspaces which contain the empty set. That is, span0 = {0}. (Had we adopted the "constructive" 
definition, the set of all linear combinations of elements in 0, the span of the empty set would have 
been just itself.) 

5.3.9. Example. For each n = 0, 1, 2, . . . define a function p„ on M by Pn(x) = x". Let V be the 
set of polynomial functions on M. It is a subspace of the vector space of continuous functions on M. 
Then V = spanjpo, pi, p2 • • • }• The sine function, whose value at x is sinx, is not in the span of 
the set {po,pi,P2 ■■■}■ 

5.3.10. Definition. A subset A (finite or not) of a vector space is linearly dependent if the 
zero vector can be written as a nontrivial linear combination of elements of A; that is, if there 
exist vectors xi, . . . , x„ S A and scalars ai, . . . , a^, not all zero, such that Y12=i (^k^k = 0. A 
subset of a vector space is linearly independent if it is not linearly dependent. 

Technically, it is a set of vectors that is linearly dependent or independent. Nevertheless, these 
terms are frequently used as if they were properties of the vectors themselves. For instance, if 
S = {xi, . . . , x„} is a finite set of vectors in a vector space, you may see the assertions "the set S is 
linearly independent" and "the vectors xi , . . . x„ are linearly independent" used interchangeably. 

5.3.11. Example. For each n = 0, 1, 2, . . . define a function pn on M by Pn{x) = x"". Then the set 
{po,Pi,P2, . . . } is a linearly independent subset of the vector space of continuous functions on M. 

5.3.12. Definition. A set B of vectors in a vector space is a (Hamel) basis for V if it is 
linearly independent and spans V. For Hilbert spaces we will encounter another type of basis, the 
orthonormal basis (see definition 28.1.14). For such spaces it must be made clear whether we are 
talking about a Hamel basis or an orthonormal basis. In the context of vector spaces, however, 
only Hamel bases make sense; so the modifier "Hamel" is usually omitted. 

In a vector space V subsets of linearly independent sets are themselves linearly independent; 
and supersets of spanning sets are spanning sets. In order for a linearly independent set to be a 
basis for V it must be "large enough" to span V; and in order for a spanning set to be a basis for V 
it must be "small enough" to be linearly independent. Thus in order for a set i3 to be a basis for V 
it must be "just the right size". We formalize this loose language in propositions 5.3.13 and 5.3.17. 

5.3.13. Proposition. Let V be a nonzero vector space and B be a nonempty subset of V . Then 
the following assertions are equivalent. 

(a) B is a maximal linearly independent subset of V. 

(b) B is a minimal spanning set for V. 

(c) B is a basis for V . 

5.3.14. Proposition. Every vector space has a basis. 
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Proof. First, notice that the empty set is a basis for the zero vector space {0}. Reason: is 
hnearly independent and its span is {0} (see remark 5.3.8). 

If y is a nonzero vector space, let £ be the set of all linearly independent subsets of V. Partially 
order this set by inclusion. The union of any chain in £ is itself linearly independent (see the next 
exercise) and is therefore an upper bound in £ for the chain. Thus according to Zorn's lemma £ 
has a maximal element, and by proposition 5.3.13 a maximal linearly independent subset of V is a 
basis for V. □ 

5.3.15. Exercise. Verify the assertion made in the preceding proof that the union of any chain of 
linearly independent subsets of a vector space is itself linearly independent. 

5.3.16. Exercise. Modify the proof of proposition 5.3.14 to prove that if A is a nonempty linearly 
independent subset of a vector space V, then there exists a basis for V which contains A. 

Hamel bases are not of great utility if they are not finite. As a consequence many linear algebra 
texts prove only that if one basis for a vector space is finite then so is any other basis and both will 
have the same cardinal number. This turns out to be true whether or not the space has a finite 
basis. 

5.3.17. Proposition. Any two bases for a vector space are cardinally equivalent. 

The proof of this general result is slightly technical. If you are interested, proofs can be 
found in [35] (theorem 1.12), [23] (theorem 4.58), [9] (page 11, theorem 2.12), or [24] (page 185, 
theorem 2.7). 

5.3.18. Definition. If a vector space V has a finite basis B, then V is said to be finite dimen- 
sional and the cardinality of B is the dimension of V. If B is infinite, then V is an infinite 
DIMENSIONAL vector space. Depending on the cardinality of B we may distinguish between count- 
ably infinite and uncountably infinite dimensional spaces. 

5.3.19. Example. The so-called standard basis vectors (1,0,0), (0,1,0), and (0,0,1) (see 4.5.8) 
constitute a (Hamel) basis for the vector space M'^. So is a 3-dimensional vector space. 

5.3.20. Example. The set {po,Pi,P2, ■ ■ ■} of polynomials defined in example 5.3.11 is a basis for 
the vector space V of polynomial functions on M. So P is a (countably) infinite dimensional vector 
space. 

5.3.21. Proposition. Every proper ideal in a unital algebra A is contained in a maximal ideal. 
Thus, in particular, if A ^ 0, then Max^ is not empty. 

5.4. Lattices 

5.4.1. Definition. A partially ordered set {S,<) is a lattice if each pair of elements has both 
an infimum and a supremum in S. 

5.4.2. Example. Any linearly ordered set is a lattice. If a < 6, then a Ab = a and a\/ b = b. In 
particular, any nonempty subset of M (under its usual ordering) is a lattice. 

5.4.3. Example. We observed in example 5.2.7 that if 5* is a nonempty set, then ^{S) is an order 
complete lattice under the partial ordering of set inclusion C. The infimum of two members of 
^(5) is their intersection; the supremum is their union. 

5.4.4. Example. Under coordinatewise ordering (see example 5.1.5) is a lattice. We have 
(a, b) A (c, d) = {a Ac,bAd) and (a, b) V (c, d) = (a V c, 6 V d) for all a, b, c, d € M. 

5.4.5. Example. Let S" be a nonempty set. Then the family J-'{S) of real valued functions on S 
under pointwise ordering (see example 5.1.8) is a lattice. For all f,gG J^{S) we have 

fAg = l[f + g-\f-g\] 

and 

/V5 = ^[/ + 5 + l/-5|]. 
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Hint for proof . The verification of tfiese identities is immediate from two (fairly obvious) prop- 
erties of the real numbers (see 6.4.1). 

We regard A and V as binary operations on a lattice L. That is, 

A:LxL-^L: {a,b) t-^ a Ab 

and 

V: L X L ^ L: {a,b) aV b . 
Lattice theorists often call the operation A and V, meet and join, respectively. 

5.4.6. Proposition. In a lattice the operations A and V are associative and commutative. 

5.4.7. Example. If m, n G N we say that m divides n and write m\n if there exists p G N such 
that pm = n. The relation | is a partial ordering on N. Under this partial ordering N is a lattice. 
What are the standard names for m An and my nl 

5.4.8. Proposition. If a and b are elements of a lattice, then the following are equivalent: 

(i) a < b; 

(ii) a Ab = a; and 

(iii) a\/b = b 

5.4.9. Proposition. If a <b and c < d in some lattice, then ay c< by d and a Ac < b Ad. 

5.4.10. Notation. If A and B are subsets of a lattice, then AyB denotes {aV6: a G A and b G B}; 
we write Ay b for Ay {b}. The definitions of A A B and A Ab are similar. 

5.4.11. Proposition. Let A and B be subsets of a lattice. If sup A and sup B exist, thensup{AyB) 
exists and 

sup{A y B) = (sup A) V (sup B) . 
Similarly, if inf A and ini B exist, then mf(A A B) exists and 

mi{A AB) = {mi A) A (inf 5) . 

5.4.12. Proposition. In a lattice every pair of elements a and b satisfies the laws of absorption 

a A {ay b) = a and ay {a Ab) = a . 

5.4.13. Definition. A lattice L is distributive if 

a V (6 A c) = (a V 6) A (a V c) 

and 

a A (6 V c) = (a A 6) V (a A c) . 

5.4.14. Example. If S" is a nonempty set, then ^{S) (partially ordered by inclusion) is a distribu- 
tive lattice. 

Of course, not every lattice is distributive. 

5.4.15. Example. On the set S = {l,a,b,c,m} define a partial order < so that I < a < m, 
I < b < m, I < c < m, and no two of the elements in the set {a, b, c} are comparable. Then {S, <) 
is a lattice but is not distributive. {Proof: a A {by c) = a while {a A b) y {a A c) = I.) 
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5.5. Lattice Homomorphisms 

5.5.1. Definition. A function f : L ^ M between lattices is a lattice HOMOMORPHISM if f{x V 
y) = f{x) V f{y) and f{x Ay) = f{x) A f{y) for all x, y £ L. 

5.5.2. Proposition. Every lattice homomorphism f : L ^ M is order preserving. 
The converse of this proposition does not hold. 

5.5.3. Example. Let S = {0, 1} and L be the lattice ^{S) (see example 5.4.3). Define a partial 
ordering on the set M = {a, b} by setting a < b. Clearly M is also a lattice. The function 
f : L ^ M defined by i— )• a, {0} i— )• a, {1} i— )• a, and 5" i— )• 6 is order preserving but is not a lattice 
homomorphism. 

5.5.4. Definition. A point c in a set 5 is a fixed point of a function /: 5 — > S" if /(c) = c. 

5.5.5. Proposition (Knaster-Tarski fixed point theorem). Every order preserving map from an 
order complete lattice into itself has a fixed point. 

Hint for proof . Let L be an order complete lattice, </>: L — )• L be order preserving, and s be 
the supremum of {x G L: x < (j){x)}. Show that </>(s) = s. 

5.5.6. Proposition (Banach Decomposition Theorem). // S and T are sets, f:S — t- T, and 

g: T — )■ S", then there exist subsets A of S and B of T such that f^{A) = B and g^{B^) = AS- . 

Hint for proof . Recall that we have seen previously (in example 5.4.3) that partially ordered 
by ^ the set ^(5) is an order complete lattice. Corresponding to the function / : 5 — )■ T is the 
mapping 

f^:^{S)^^[T): A^{f{a):a£A} 

which takes each subset S to its image under / in T . We also define the mapping 

C5:q3(5)^q}(5): yl^^'^ 

which takes each subset of S to its complement. The maps g~^ and Ct are defined similarly. Now 
consider the composite function (f) defined by 

(?:>:= o 5^ o Ct o . 

Use the preceding proposition. 

5.5.7. Theorem (Cantor-Schroder-Bernstein Theorem). If S andT are sets and both f : S ^ T 
and g: T ^ S are injective, then there exists a bijection h: 5" — t- T. 

5.5.8. Definition. We will now extend the notion of cardinality introduced in 3.1.3 to general 
sets. We will say that sets A and B have the same cardinality, and will write card A = cardi?, if 
there exists a bijection from A onto B. And we will say that the cardinality of A is less than or 
equal to the cardinality of B, and write card A < cardi?, if there exists an injective map from A 
into B. Theorem 5.5.7 provides us with the appropriate antisymmetry result. 

5.5.9. Proposition. Let A and B be sets. If card A < caidB and cardB < card A, then carded = 
cardi?. 

Thus it is clear that on any given family of sets cardinality induces an equivalence relation. 

5.6. Boolean Algebras 

5.6.1. Definition. A function f : S ^ S from a set to itself is an involution if it is its own 

inverse; that is if /(/(s)) = s for all s G S*. 

5.6.2. Definition. A boolean algebra is a nonempty distributive lattice L together with an 
involution a i— t- a' on L such that for all a, b £ L 

(a) if a < 6, then b' < a' , and 
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(b) a A a' < 6 < a V a'. 

The element a' is often called the complement of a. Notice that condition (b) says that (for any 
element a whatever) a A a' is the smallest element of a Boolean algebra; we denote it by 0. Similarly, 
a V a' is the largest element of the algebra; it is denoted by 1 . If in a Boolean algebra = 1, then 
the algebra consists of a single element. This is the trivial Boolean algebra. Any other Boolean 
algebra is nontrivial. 

5.6.3. Proposition. In a Boolean algebra the mapping a' is hath infective and surjective. The 
converse of condition (a) above holds in any Boolean algebra. 

5.6.4. Example. The set Z2 = {0, 1} is the smallest nontrivial Boolean algebra. Here the ordering 
< 1 and complements 0' = 1 are just what you would expect. 

5.6.5. Example. In example 5.4.14 we saw that the power set ^{S) of a set 5" is a lattice. Under 
the involution A which takes a subset of S to its complement (in S) , the power set becomes 
a Boolean algebra. The element is the empty set and the element 1 is 5" itself. 

5.6.6. Example. Let S be the set of sentences which make up the so-called prepositional calculus. 
For our purposes all we need to know (or assume) about the propositional calculus is that it is a 
set of sentences closed under the usual logical operations: A ("and"), V ("or"), ~ ("not"), =^ 

( "implies" ) , and <;=^ ( "if and only if" or "logical equivalence" ) . A good place to find background 
material on the propositional calculus is the beautiful, slim, and readable book by Halmos and 
Givant, Logic as Algebra [ ]. 

It is clear that =^ is a preordering on S but not a partial ordering. We remedy this 
situation by using a quotient construction. The operation <^=^ of logical equivalence is indeed an 
equivalence relation on the set S of sentences. The resulting equivalence classes, which partition S 
(see proposition 2.11.7), we will call propositions. As we saw in section 2.11 the set P = S/<^^ 
of propositions (equivalence classes) is the quotient of S modulo the relation <;=^ . As usual, we 
denote the proposition containing a sentence s by [s]. The relation =^ on S induces a relation, 
which we also denote by =^ , on P in the obvious fashion: [s] =^ [t] whenever s =^ t. This 
relation on P is clearly well-defined since it does not depend on the choice of representatives from 
the equivalence classes (see the discussion just before exercise 2.11.10); that is, if s =^ t, if s' is 
logically equivalent to s, and if t' is logically equivalent to t, then s' =^ t' . It is quite clear that 
the new relation =^ on P is not only a preordering but in fact a partial ordering. (If [s] =^ [t] 
and [t] =^ [s], then s =^ t and t =^ s, so that s <^=^ t. Since s and t are equivalent, 
[s] = [t].) 

In a similar fashion the other logical connectives ~ ("not"), A ("and"), and V ("or") may be 
unambiguously extended to the set P of propositions. Then, under the partial ordering =^ , the 
proposition pAq really is the infimum of the propositions p and q, and pV q really is the supremum 
of p and q. This tells us that {P, =^ ) is a lattice. It is not difficult to check that it is in fact a 
distributive lattice. Furthermore, with the addition of ~ ("not") as an involution on P, the set of 
propositions becomes a Boolean algebra. The element of this Boolean algebra is the equivalence 
class of all self-contradictory sentences — sometimes denoted by F (for false); and the 1 element is 
the set of all tautologies — sometimes denote by T (for true). 

Hint for proof . There are quite a few details to be verified. To avoid confusion in the crucial 
arguments that p A q really is the infimum of p and q and that p\/ q really is the supremum of p 
and q be careful with notation: for example, use p Aq to mean "p and q" and use mi{p, q} for the 
greatest lower bound of p and q. Since the point of the argument is to prove that these two are the 
same, it would not be a good idea initially to use the same notation for both. 

The power set of a set and the propositional calculus are "good" examples of Boolean algebras 
in the sense that many familiar results from set theory and logic turn out to be true in general 
Boolean algebras. Here is the Boolean algebra version of De Morgan's laws. 
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5.6.7. Proposition. Let S be a Boolean algebra with partial ordering < and involution a i— )• a' . 
Then 

(a) (a V 6)' = a' A 6' 
and, dually, 

(b) (aA6)' = a'v6' 
for all a, b G S. 

Definition 5.6.2 is one standard approach to Boolean algebras. Another, equivalent, way is to 
specify that a Boolean algebra is a set S together with two binary operations A and V, which are 
associative and commutative, an involution s i— t- s'^, and two distinguished elements and 1 such 
that the following conditions are satisfied: 

(a) A distributes over V, and V distributes over A, 

(b) s V (s A t) = s A (s V t) = s for all s, t e S, and 

(c) s A = and s V = 1 for all s £ S. 

5.6.8. Exercise. Prove that the definition in the preceding paragraph is equivalent to the one 
given in definition 5.6.2. 

A third approach to Boolean algebras is through Boolean rings (see definition 4.6.22). In fact, 
a Boolean algebra is nothing more than a unital Boolean ring. Do the next two exercises to see 
how this comes about. 

5.6.9. Exercise. Let (i?, +, •) be a unital Boolean ring with multiplicative identity 1. For all a, 
b £ Rlet a < b ab = a and let a' = 1 + a. Then < is a partial ordering on R and the map a' 
is an involution on R. Equipped with this partial ordering and involution the ring i? is a Boolean 
algebra. 

5.6.10. Exercise. Let i? be a Boolean algebra with partial ordering < and involution a i— )• a' . For 
all a, b G R define 

a + 6 := (a A 6') V (a A b) 

and 

a • b := a A b . 

Then (R, + , •) is a unital Boolean ring. 

5.6.11. Definition. "Algebra" is a seriously overworked term. Classically an algebra (or a 
field) of subsets of a set S is defined as a family 21 of subsets of S which is closed under the 
operations of union and complementation and which contains S. The next proposition shows that 
this makes 21 into a Boolean algebra — which helps explain why the term "algebra of sets" is used 
for 21 even though it is not an algebra (or a field, for that matter) in the usual algebraic sense. 
Notice that (by a trivial inductive argument) if ^i, . . . , An all belong to an algebra 21 of sets, then 
so does Ufc=i ^k- 

5.6.12. Proposition. Let 21 be an algebra of sets; that is, a nonempty family of subsets of a set S 
such that 

(a) A U B G 21 whenever A, B £ 21; and 

(b) G 21 whenever A G 21. 

Then the family 21 is closed under the binary operations of intersection, set subtraction, and sym- 
metric difference; it contains and S; and it is a Boolean algebra under set inclusion (as a partial 
ordering) and complementation (as involution). Note also that (21, A, n) is a unital Boolean ring. 

5.6.13. Proposition. Let S be a set and be a family of algebras of subsets of S. Then is 
an algebra of subsets of S. 
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5.6.14. Exercise. Let 5^ be a family of subsets of a nonempty set S. Explain in careful detail 
what we mean by "the smallest algebra of sets containing Prove that such a thing exists. (If 
you have trouble with this look back at remark 5.3.6.) 

5.6.15. Example. Let S be an infinite set and ^ = {{x}: x G 5}. The smallest algebra of sets 
containing 5 comprises all finite and cofinite subsets of S. (A subset yl of a set S is COFINITE if its 
complement is finite.) 
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THE REAL NUMBERS 

6.1. Axioms Defining the Real Numbers 

The real numbers are typically approached axiomatically. That is, the set M of real numbers is 
defined to be a set together with two operations and an ordering which satisfy a list of four conditions 
(also called axioms) we impose. (Naturally these conditions are not arbitrary inventions, but have 
a long and complicated history behind them.) This leads to three rather big questions: 

(1) Is there anything out there that actually satisfies these axioms? In other words, are the 
axioms consistent? 

(2) Why do we speak of the real numbers? How do we know that there are not two essentially 
different things that both satisfy the axioms? 

(3) How do we get all the standard "facts" about the real numbers from the axioms? 

We sketch an answer to the first question in section 6.2 by describing very briefly the process through 
which we can construct a model for the axioms, that is, we discuss the creation of a set theoretic 
object which satisfies the axioms. The second question we answer without proof: the axiom system 
for R is categorical; that is, there is essentially only one model for the axioms defining the reals. 
(Two models are "essentially" the same if they are isomorphic.) Finally, we will assume that the 
most elementary parts of an answer to the third question (the business we usually call arithmetic) 
have been adequately covered in earlier courses. 

6.1.1. Definition. Let M be a set, let + and • be binary operations on M, and PCM. The 
quadruple {R, +, ■,¥) is the system of real numbers if the axioms I-IV below are satisfied. 

6.1.2. Axiom (I). (M,+,-) is afield. 

6.1.3. Axiom (II). P is closed under both + and ■. 

6.1.4. Axiom (HI). € PPl (— P) and if x ^0, then x belongs to either ¥ or but not both. 

The set P is the positive cone in M. Its members are the positive real numbers. It is worth 
emphasizing that in these notes, a number x is positive if a; > 0. If we want x > 0, we say that x 
is strictly positive. Similar remarks hold for negative and strictly negative. In axiom (HI), as 
in 4.4.10, — P is the set of additive inverses of members of P. In subsequent chapters of these notes 
we will use the more common notation M"*" for P. Axiom (HI) is often called the law of trichotomy. 
It says that every real number x satisfies exactly one of three conditions: it is 0, or it is strictly 
positive (x > 0), or it is strictly negative {x < 0). 

Define an ordering on M by 

x < y if and only if y — x G P . 
We call this the ordering on M induced by the positive cone P. 

6.1.5. Proposition. The ordering < defined above is a linear ordering on M. 

6.1.6. Axiom (IV). Under the ordering < defined above, M is Dedekind complete. 

As was mentioned above we will take for granted the most elementary consequences of these 
axioms (grade school arithmetic through college algebra). For an elegant synopsis of this material 
written for grownups, see chapter 1 of Stromberg's An Introduction to Classical Real Analysis [^o]. 
Below are are few points that are perhaps worthy of review. 
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6.1.7. Proposition. The partial ordering < induced on M by its positive cone respects the algebraic 
operations on M in the sense that 

X + z < y + z whenever x < y 

and 

xz < yz whenever x < y and < z . 

6.1.8. Proposition. Let A be a nonempty subset o/M. If A has a greatest lower bound, then —A 
has a least upper bound and 

sup(-A) = -inf ^. 

If you wish you may skip the proof of this result for the moment. We wih see in proposition 9.1.7 
that it holds more generally in all ordered vector spaces. The next proposition also holds in an 
arbitrary ordered vector space (see proposition 9.1.5). 

6.1.9. Proposition. Let A and B be nonempty subsets o/M. If A and B both have least upper 
bounds, then so does A + B and 

sup{A + B) = sup A + sup B . 

6.1.10. Proposition. Let A and B be nonempty subsets o/P. If A and B both have least upper 
bounds, then so does AB = {ab: a £ A and b G B} and 

sup{AB) = (sup A)(supS) . 

6.1.11. Exercise. State and prove the principle of mathematical induction. 

Hint for proof. Here is a little background that you may find useful. First, make a definition: 
A collection J of real numbers is inductive if 

(i) 1 G J, and 

(ii) a; + 1 G J whenever x £ J. 

Then prove that the intersection of a family of inductive subsets of M is inductive. Use this to show 
that there exists a smallest inductive set. Call this the set of natural numbers and denote it by N. 
Now here is an elegant statement of the principle of mathematical induction: Every inductive subset 
of N equals N. (Is the proof of this obvious?) Use this to prove the statement of mathematical 
induction with which you are familiar. 

6.1.12. Proposition (The Archimedean Property of M). // a G M and e > 0, then there exists 
71 G N such that ne > a. 

The following important fact is an easy consequence of the Archimedean property of R. 

6.1.13. Corollary. For every pair a and b of real numbers with a < b there exists a rational number 
r such that a < r < b. 

6.2. Construction of the Real Numbers 

An important question to ask whenever we write down a definition (or, put diff'erently, a set of 
axioms) for some mathematical object is whether any such object actually exists. In other words, 
is the set of axioms consistent? This turns out in general to be too difficult to answer. The best we 
can ordinarily do is to show that a set of axioms (a definition) is relatively consistent. That is, we try 
to show that if some other bit of mathematics (usually set theory or category theory) is consistent, 
then so is the bit we are interested in. Thus confronted, for example, with the axioms defining 
the real numbers, one may well ask, "But is there anything that satisfies these axioms?" That is, 
"Do these axioms have a model? The answer in this case is a qualified yes; the axioms for the real 
numbers are relatively consistent — if we believe (or are willing to assume) that some set of axioms 
(Zermelo-Fraenkel, for instance) governing sets is consistent, then we can construct a set-theoretic 
model for the real numbers. For more information on the axioms for Zermelo-Fraenkel set theory 
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consult any text on axiomatic set theory or visit a good web site (for example, Mathworld [ ' '] or 
Wikipedia [46]) and search for something like "Zermelo-Fraenkel axioms." 

The construction of the real number system from sets is a fairly intricate (and, for some tastes, 
a rather tedious) business. In some universities a one-term or one-semester course is devoted to 
doing just that. Here we will simply outline a few of the major steps in the program, paying close 
attention to only one of them — the so-called Grothendieck construction, which produces an Abelian 
group from a commutative semigroup. The reason for singling out this one tool is its amazing 
usefulness in many fields of more advanced mathematics. 

First Steps. The process may be started by defining the natural numbers. We take the natural 
number 1 to be {0}. ^ For any natural number n already defined we define its SUCCESSOR S{n) 
to be {n}. We amalgamate all the objects 1, 5(1), ^(^(l)), ^(^(^(l))), and so on, into a set N. 
(Naturally, this intuitive amalgamation process requires for its justification an explicit set-theoretic 
axiom, usually called the axiom of infinity. For details on this consult [45] or [46].) Then we give 
these objects their conventional names: 2 := S{1), 3 := S{S{1)) = S{2), 4 := ^(^(^(l))) = 5(3), 
and so on. Thus we have the familiar set N = {1, 2, 3, 4, ... } of natural numbers. The successor 
function n i— )• 5(n) on this set gives us an obvious way of ordering this set: 1 < 2, 2 < 3, . . . . One 
verifies that this creates a linear ordering on N. One shows, in fact, that this is a well-ordering 
of N. (An ordered set is well-ordered if every nonempty subset of the set las a least element.) 

The next step is to create an arithmetic for the natural numbers by defining addition and 
multiplication on N. This is done inductively. For every m define 

m + 1 := 5(m) 

and, supposing that m + n has already been defined, let 

m + 5(n) := 5(m + n) . 

One now verifies that under this operation N is a commutative semigroup having the cancellation 
property. This is the point where we make our first use of the Grothendieck construction, which we 
now examine in detail. 

The Grothendieck Construction. Throughout this subsection (5, +) is a commutative semi- 
group. On the Cartesian product 5x5 define a relation ~ by declaring (a, b) ~ (c, d) if an only if 
there exists z £ S such that 

a + d + z = b + c + z. 

6.2.1. Proposition. The relation ~ is an equivalence relation on S x S. 

Our notation for the equivalence class containing the pair (a, b) would normally be [(a, b)]. For 
economy we will shorten it here to (a, b). Let G{S) be (5x5)/ ~, the quotient of 5 x 5 by ~. On 
G(5) define a binary operation (also denoted by +) by: 

(a, 6) + {c,d) := {a + c,b + d) . 

6.2.2. Proposition. The operation + defined above is well-defined. 

6.2.3. Proposition. Under the operation + defined above G{S) is an Abelian group. 
The resulting Abelian group is called the Grothendieck GROUP generated by 5. 

6.2.4. Definition. If 0: 5 — )• G is an injective homomorphism from a semigroup into a group (so 
that 5 is isomorphic to its image under (p), say that (f) is an (algebraic) embedding and that 5 

is EMBEDDED in G. 

Now for an arbitrary a £ S define a mapping 

4>: S ^ G{S) : s^ {a,a + s) . 



^Of course, we could start with :— but in these notes belongs to Z""" (the set of positive integers) but not 
to N (the set of natural numbers). 
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6.2.5. Proposition. The mapping (p above is well-defined. That is, the definition is independent 
of the choice of a. 

6.2.6. Proposition. The mapping (p is a semigroup homomorphism. 
This homomorphism is the Grothendieck map associated with S. 

6.2.7. Proposition. The Grothendieck mapping associated with a semigroup S is an embedding if 
and only if S has the cancellation property. 

6.2.8. Exercise. Take S to be N. Explain why it is reasonable to regard the resulting Abelian 
group as the group Z of integers. Explain the process for assigning old names (like 1, 2, 3, . . . ) to 
the new objects living in G(N). Then identify the objects in G(N) to which we assign new names 
(like 0, —1, —2, . . . ). Also explain how we can extend the ordering on N to a linear ordering on Z. 

6.2.1. The Rational Numbers. The point of the last subsection was to demonstrate how we 
can construct the Abelian group of integers from the commutative semigroup of natural numbers. 
Before proceeding to create the rational numbers it is convenient to define multiplication on the 
integers in such a way that they become a ring. One can use an inductive technique virtually 
identical to the one in the last paragraph of subsection 6.2 to define multiplication on N and then 
extend the operation to all of Z. Of course, it must be proved that these rather obvious definitions 
do indeed create a ring structure on Z. 

Now to produce the rational numbers we apply the Grothendieck technique to the commutative 
semigroup Z under multiplication — with one slight change. Instead of working with 'Lx'Lwe work 
with Z X Zq where Zq is the set of nonzero integers. As before say that two ordered pairs in Z x Zq 
are equivalent, {m,n) ~ {p,q), if there exists z G Z such that mqz = npz. The ~ turns out to 
be an equivalence relation on Z x Zq. Let Q be the family (Z x Zq)/ ~ of equivalence classes. 
We adopt familiar notation by designating the equivalence class containing the pair (m, n) by ^. 
Define multiplication on Q by 

m p mp 
n q nq 

and addition by 

m p mq + np 
n q nq 

Also define a map -0 : Z — t- Q : n i— )• j . 

6.2.9. Exercise. Explain how the presence of an identity element in the multiplicative semigroup 
7j allows us to simplify the definition of the equivalence relation ~ above. 

6.2.10. Proposition. Addition and multiplication are well-defined on Q, and under these opera- 
tions Q is a field. 

6.2.11. Proposition. The mapping Z — )• Q defined above is an embedding of the commuta- 
tive (multiplicative) semigroup Z into the Abelian (multiplicative) group Q. The mapping ip also 
preserves addition; that is, ip[m + n) = ^{m) + •i/^(n) for all m, n €z Z. 

6.2.12. Exercise. Explain how to extend the linear ordering on Z to a linear ordering on Q. 

6.2.2. From Q to M. Once we have the ordered field Q, it is just one more step to the reals. 
The central idea here is that Q becomes a metric space when we define the distance between to 
rational numbers r and s to be |r — s|, but this metric space is not complete. It turns out to be 
possible to complete an arbitrary metric space, and the completion of the rationals is the set M of 
real numbers. (We will define and discuss metric spaces in section 10.4. Complete spaces and the 
process for completing a space appear in section 20.2. Of particular interest in the present context 
is proposition 20.2.4.) 

Having produced the set M we next extend the operations of addition and multiplication and 
the linear ordering relation from Q to M. The final, but crucial, thing that must be proved is that 
the object we have so laboriously constructed really does satisfy the axioms for the real numbers 
given in section 6.1. 
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6.3. Elementary Functions 

In analysis the next logical step after developing the real number system is to derive in detail 
the most important properties of the elementary functions: the algebraic (including polynomial, 
rational, and power) functions, the trigonometric functions, the exponential functions, the inverses 
of all these (including radical, inverse trigonometric, and logarithmic functions), and all possible 
composites of the preceding. Pursuing this, I feel, would be far too lengthy a diversion from the 
principal subject material of these notes. So the standard properties of the elementary functions 
will, for the most part, simply be assumed. Here is a (very short) list indicating the sort of thing 
you may expect to see used without proof in the sequel. 

(a) The n^'^-root function x i— t- x^^^ is strictly increasing on the interval (0, oo) for every n G N. 

(b) The arctangent function is strictly increasing on M. Its derivative is (1 + x^)~^ for every x. 
It is concave (down) on the interval [0, oo). 

(c) The exponential function x i— )• is strictly increasing and convex (up) on R. 

(d) If In is the natural logarithm function on (0, oo) and x, y > 0, then Inxy = Inx + Iny. 

(e) arcsin | = vr/G. 

What should you do if you are nervous about agreeing to some (or all) of the assumptions 
made in these notes concerning real numbers and elementary functions? My advice is to consult 
Karl Stromberg's amazing text, An Introduction to Classical Real Analysis [ ]. It is a beautifully 
written, nearly flawless, essentially self-contained introduction to, among other topics, the real (and 
complex) number systems and the elementary functions. 



6.4. Absolute Value 

There are a few facts about the absolute value function which are used so frequently that it seems 
a good idea to write them down explicitly. In section 6.3 we avoided a discussion of the elementary 
functions by agreeing to assume knowledge of their standard properties. If, in particular, we are 
familiar with the square root function ^/ : [0, oo) — )• M, then it makes sense to define as Vx^ 
for each real number x. It is clear that if x > 0, then |x| = x; and if x < 0, then |x| = —x. The 
next exercise is quite simple to prove and parts of it crop up so often that they will usually be used 
without explicit reference. 

6.4.1. Exercise. If a, 6 G M, then 

(a) |a6| = \a\ \b\; 

(b) |a + 6| < \a\ + (the triangle inequality) 

(c) I \a\ — |6| I < |a — 6|; and 

(d) \ab\ < lia^ + b^). 

Hint for proof of (d). Consider the squares of a — 5 and a + b. 

It is interesting and somewhat surprising that the "algebraic" function absolute value and the 
"order" operations sup and inf are in a sense interchangeable. If we have one, we get the other. 
Much use is made of these relationships in the setting of Riesz spaces (beginning in chapter 14). 

6.4.2. Proposition. If x is a real number, then 



It is also true that if x €M, then 



and 



x\ = xV (— x) = — (x A (— x)) . 

xV y = l{x + y + \x - y\) 
xAy = ^(x + y — |x — y|). 



(6.1) 

(6.2) 
(6.3) 
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6.5. Some Useful Inequalities 

Inequalities involving real numbers are often essential ingredients of proofs and derivations in 
nearly every field of mathematics. In this section we give a (very) few of the most useful. The first 
is known as the Schwarz inequality (or as the Cauchy inequality, or as the Bunyakovskii inequality, 
or as any combination of the preceding). 

6.5.1. Proposition (Schwarz's Inequality). Let ui, . . . , Un, ui, . . . , fn G M. Then 

2 



k=l ^ ^k=l ^ ^fc=l ^ 



Proof. To simplify notation make some abbreviations: let a = ^ ui? , 6 = V}?' , and 

fc=l k=\ 

n 

c= Yj ^kVk- Then 

k=l 



2 



k=l 

ah - 2(? + (? 

ah-(? . □ 



A consequence of Schwarz's inequality is the Minkowski inequality. 
6.5.2. Proposition (Minkowski's Inequality). Let ui, . . . , Un,vi, . . . , f„ G M. Then 



1 . r, ,1 



V; — 1 / — 1 / Vt — 1 / 



Proof. Let a, b, and c be as in proposition 6.5.1. Then 

n 

^(ufc + Vk)"^ = a + 2c + b 
k=l 

<a + 2\c\ + b 



<a + 2Vab + b (by 6.5.1) 

= {^+Vbf. □ 
Before we state a version of Bernoulli's inequality 6.5.5 we develop some background. 

6.5.3. Definition. If n is a positive integer we define n! (read "n factorial) inductively by 
! := 1 and n! := n(n — 1)! whenever n > 1. If A; and n are positive integers and k < n, then the 
BINOMIAL coefficient (^) is defined by 

n\ n! 
^k) k\{n-ky.' 

The coefficient (^) is read "n by A;" or "n choose k" . 

6.5.4. Lemma. If k and n are positive integers and 1 < k < n, then 

n \ / n\ /n+1 

k-l) ^ \k) ^ \ k 

6.5.5. Proposition (Binomial Theorem). If a and b are real numbers and n is a positive integer, 
then 



A;=0 ^ ^ 
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6.5.6. Exercise. Find the sum of the binomial coefficients (^) from A; = to A; = n. 
Hint. 1 + 1 = 2. 

6.5.7. Proposition (Bernoulli's Inequality). Let x G M and n G N. If x > and n > 1, then 

(1 + x)" > 1 + nx. 

6.5.8. Exercise. Use the Schwarz inequality to prove part (a) and part (a) to prove part (b). 

(a) If ai, a2, . . . , a„ are strictly positive real numbers, then 

^k=l ^ ^k=l 

(b) If a, b, and c are strictly positive real numbers such that a + 6 + c = 1, then 

6.6. Complex Numbers 

Later in these notes we will encounter spaces of complex valued functions. The complex numbers 
are numbers of the form a + hi where a and b are real numbers and i is a complex number whose 
square is —1. The arithmetic of the complex numbers is easy. Addition is defined in the obvious 
way 

(a + hi) + (c + di) := (a + c) + (6 + d)i 

and so is multiplication 

(a + hi)[c + di) := {ac — bd) + {ad + bc)i . 

It is routine to verify that under these operations the set C = {a + bi : a, b £ M} is a field. 

Let z = a + bihe a complex number (with a, b £M). Then a is the real part of z and b is its 
IMAGINARY PART. The ABSOLUTE VALUE of z is Va?~+lp . The DISTANCE between two complex 
numbers z = a + bi and w = c + di is \z — w\. Another routine argument shows that this defines 
a metric (see section 10.4) on C. The map (a, b) a + bi is an isometry (distance preserving 
mapping) from (with its usual Euclidean distance onto the complex plane C. Thus the complex 
plane C can be viewed as just the usual Euclidean plane with additional arithmetic operations 
making it into a field. 

The only problem with the foregoing discussion is the second sentence of this section where 
the "complex numbers" are defined in terms of a particular "complex number" i. It is certainly 
reasonable to be deeply suspicious of this circularity. What would lead us to believe that there is 
any object whatever whose square is —1? There is certainly no real number with this property. If 
you don't already know the answer to this question and are interested in finding out what sort of 
"number" has square —1, read definition 6.6.1 below. Otherwise, it can be skipped. 

6.6.1. Definition. A complex number is an ordered pair of real numbers, that is, an element 
of . We define addition of complex numbers pointwise 

(a, b) + (c, d) := (a + c, 6 + d) 

and multiplication of complex numbers by 

(a, 6)(c, d) := (ac — bd, ad + be) . 

Notice that the map M — )• C: a i— )• (a, 0) is an injective homomorphism. The fields M and 
raiKp are isomorphic. We will use this isomorphism to identify the field of real numbers with 
the subfield of complex numbers of the form (a, 0). In this way every real number becomes a 
(special kind of) complex number. We will abuse notation and write a = {a, 0) whenever it seems 
convenient. 
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Next we will define i := (0,1). Notice that since = (0, 1)(0, 1) = (—1,0) = —1, we have 
succeeded in finding a complex number whose square is —1. Notice also that if a and b are real 
numbers, then 

a + bi = (a,0) + (6,0) (0,1) 
= (a,0) + (0,6) 
= {a,b). 

Thus a + hi is just another way of writing the complex number (a, b). Now the first paragraph of 
this section should make sense. 

6.6.2. Exercise. Show that the field C of complex numbers cannot be linearly ordered. That is, 

show that there is no subset P of C such that 

(i) the sets P, — P, and {0} are pairwise disjoint and their union is all of C; and 

(ii) P is closed under addition and multiplication. 



CHAPTER 7 



SEQUENCES AND INDEXED FAMILIES 



There are many, many ways of thinking about functions. And sometimes these ways of thinking 
about functions take on a Hfe of their own. We may find ourselves working with objects that are in 
fact functions without being conscious of the fact. Consider as an example a function x: N4 — > M. 
To completely specify the function we need to know its values at each point in its domain {1, 2, 3, 4}. 
So suppose we know the values x(l), x{2), x{3), and x(4). There is no harm in rewriting these 
numbers as xi, X2, x^, and X4. Since these values completely specify the function x it is common 
practice to say that the function is the list of these values. That is, x = {xi,X2,xs,X4). Thus 
our function has become an ordered quadruple of real numbers or, we may perhaps say, a point in 

Xl X2 

Euclidean 4-space. If, however, we arrange these four values to look like our function has 

[a;3 X4 

become a 2 x 2 matrix of real numbers. So in one sense a 2 x 2 matrix of real numbers is the same 
thing as a quadruple of real numbers: each is just a function from N4 into M. On the other hand 
they are normally regarded as quite different objects. Imagine how perplexed someone would be if 
you asked them to find the determinant of some ordered quadruple, or to calculate its trace. It is a 
mark, I think, of mathematical maturity to feel comfortable with mathematical objects no matter 
what disguise they choose to wear to a particular party. 



7.1. Sequences 

Everyone learns in elementary calculus (or before) to distinguish between the ordered pair (1, 2) 
and the set {1, 2}. The sets {1, 2} and {2, 1} are equal but the ordered pairs (1, 2) and (2, 1) are not. 
The ordered pair (3, 3) has two coordinates while the set {3, 3} has exactly one member. The same 
distinction must be made between a countable set {xi, X2, X3, . . .} and the sequence (xi, X2, X3, . . .). 
The first is a set and the second is a function. It is perverse, I think, in some elementary texts to 
carefully distinguish notationally between ordered pairs and sets (parentheses for pairs, braces for 
sets) and then use set notation {xi,X2, . . . } for sequences. 

7.1.1. Definition. A sequence is a function whose domain is the set N of natural numbers. If 

5 is a set then a function a: N — t- is called a sequence of elements of S. If a is a sequence, it 
is conventional to write its value at a natural number n as rather than as a{n). The sequence 
itself may be written in a number of ways: 

a = (an)$^i = (a„) = (ai, 02, as, • • • )• 

The element a„ in the range of a sequence is the n^^ term of the sequence. 

It is important to distinguish in one's thinking between a sequence and its range, which is a 
countable set. Think of a sequence {xi,X2,X2, . . . ) as an ordered set: there is a first element xi, 
and a second element X2, and so on. The range {xi, X2, X3, . . . } is just a set. There is no "first" 
element. For example, the sequences (1, 2, 3, 4, 5, 6, ... ) and (2, 1, 4, 3, 6, 5, ... ) are different whereas 
the sets {1, 2, 3, 4, 5, 6, ... } and {2, 1, 4, 3, 6, 5, ... } are exactly the same (both are N). The sequence 
(5, 5,5,5,...) has infinitely many terms; the set {5, 5,5,5,...} has exactly one member. 

Nevertheless, some terminology is carried over from the range of a sequence to the sequence 
itself. In particular, a sequence of real numbers is BOUNDED ABOVE if its range is; it is BOUNDED 
BELOW if its range is; and it is BOUNDED if its range is. 
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7.1.2. Remark. Occasionally it is convenient to alter the preceding definition a bit to allow the 
domain of a sequence to be some other set such as, for example, N U {0} the set of all positive 
integers. 

Studying convergence (that is, the "long term" behavior) of sequences and nets is one of the 
most important topics in analysis. The words "eventually" and "frequently" are introduced to 
reduce the number of quantifiers which appear in discussions of convergence. 

7.1.3. Definition. Let (xn) be a sequence of elements of a set S and P be some property that 
members of S may possess. We say that the sequence (xn) eventually has property P if there 
exists no € N such that x„ has property P for every n > uq. (Another way to say the same 
thing: x„ has property P for all but finitely many n.) For example, if j4 is a subset of S, then 
the sequence (x„) is eventually in A if there is an index no such that Xn & A whenever n > no. 
Sometimes instead of saying that a sequence (x„) eventually has property P we say that (x„) has 
property P for sufficiently large n or, more briefly, that (xn) has property P for large n. 

7.1.4. Example. Let 5 be a set, (An) be a sequence of subsets of S, and c an element of S. Now 
{An) is a sequence in the set *P(S) and the property in question is containing the point c. So we may 
say that the sequence (An) eventually contains c (or, equivalently, that the point c is eventually 
IN {An); or that c G An for all but finitely many n) if there exists A; S N such that for all n > k we 
have c £ An- 

7.1.5. Example. The sequence (n^^) of real numbers is eventually less than 0.001. 

7.1.6. Definition. Let {xn) be a sequence of elements of a set S and P be some property that 
members of S may possess. We say that the sequence (x„,) frequently has property P if for every 
k £N there exists n > k such that x„ has property P. (An equivalent formulation: Xn has property 
P for infinitely many n.) For example, if ^ C S*, we say that the sequence (x„) is frequently in 
the set A if for every A; G N there exists n > k such that Xn S A. 

7.1.7. Example. Let 5 be a set, {An) be a sequence of subsets of S, and c an element of S. Now 
{An) is a sequence in the set ^{S) and the property in question is containing the point c. So we may 
say that the sequence {An) frequently contains c (or equivalently, that the point c is frequently 
IN {An); or c G An for infinitely many n) if for every /c G M there exists n ^ k such that c G An • 

7.2. Indexed Families of Sets 

In section 1.2 we discussed families of sets. We now introduce indexed families of sets. The 
distinction between families and indexed families is very similar to the distinction between countable 
sets and sequences. 

7.2.1. Definition. Let A and S be sets. An indexed family of elements of 5 is a function / from 
A into S. We should be quite clear about this: indexed family is just another term for function. 
(See the first paragraph of this chapter.) The domain A of the indexed family / is referred to as 
the index set of /. When thinking of the function / as an indexed family it is conventional to write 
its value at X € A as f\ rather than /(A). The indexed family itself may be written in a number of 
ways: 

(Compare this with 7.1.1.) The range of / is, of course, a subset {fx'- A G A} of S, just as the 
range of a sequence is a countable set. 

7.2.2. Example. Suppose in definition 7.2.1 that 5 is the family ^(T) of all subsets of a nonempty 
set r. Then a function A: A ^ ^{T) would be an indexed family of subsets of T and, thought of 
that way, would usually be written as (^a);^^^- 
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Just as we use different notations to distinguish a sequence (a^) from its range the countable 
set {an - n S N}, we should be careful to distinguish between an indexed family {Ax) ^^j^^ of sets 
and its range the family {^a^ A G A}. Not everyone does this. One argument for not doing so is 
linguistic: {^a : X G A} is a family of sets and each set Ax has an index X, so why shouldn't I call 
{Ax: A G A} an indexed family? 

You may in your reading run across the hybrid notation {^a};^^^- There are at least two 
sources for this. One is a failure to make the distinction we have just been discussing, in which case 
{Ax) ^^j^, {^x}x<^A^ ^^'^ {^a: a G A} are likely to be used interchangeably. Another is the use of 
this notation to denote multisets in some current work. (A multiset is much like a set but members 
may have multiple occurrences. For example, the prime factorization of 360 is 2 • 2 • 2 • 3 • 3 • 5. As 
a set {2,2,2,3,3,5} has cardinality 3 while as a multiset it is regarded as having cardinality 6.) 
Multisets will not be discussed in these notes. And the notation {^a};^^^ will not be used. 

7.2.3. Definition. RecaU from definition 1.4.4 that a family 21 of sets is (pairwise) disjoint if 
^4 n -B = whenever A and B are distinct members of 21. An indexed family (^a);^^^ of sets is 
pairwise disjoint if ^a n j4^ = whenever A and ^ are distinct members of A. 

7.2.4. Example. Suppose J = {1,2,3}, Ai = A2 = [0,1] and A3 = [2,3]. Then {Aj: j G J} is 
pairwise disjoint while {Aj)_.^j is not. 

7.2.5. Exercise. Let A = N and Ax = M for every A G A. Explain carefully why the family 
{^a: a G A} of sets is pairwise disjoint while the indexed family {Ax) ^^^^ is not. 

There are many situations where the indexed family notation is both appropriate and useful. 
If associated with each set in some family of sets there is a real number, then (Sa)^^^ may well be 
the most natural notation for the family. If, on the other hand, an index set plays no role whatever 
in a theorem or its proof, it is usually simpler to denote a family of sets by a single letter. You 
need not do as many authors do and compulsively index everything in sight! 

7.3. Limit Inferior and Limit Superior (for Sets) 

7.3.1. Definition. If (vl„)^-^ is a sequence of sets, then the limit inferior (or lower limit) 
of the sequence is defined by 

00 00 

liminf A„ := I J Pi Afc . 

n=l k=n 

Similarly, the limit superior (or upper limit) of the sequence is defined by 

00 00 

limsupyln := n N • 

n=l k=n 

If the sets liminf and limsupj4„ are equal, then we say that the limit of the sequence {An) 
exists and is equal to this set. If L = liminf An = limsup74„, then we write 

lim An = L 

or 

An — )• L as n — )• 00 . 

Most people say (and write) "liminf" for "limit inferior" and "lim sup" (pronounced lim-soup) for 
"limit superior". Usually we write liminfyl^ for liminf lim sup A„ for lim sup and limA„ 
for hm An. "^"^ 



Many students find the preceding definitions uninviting. The next two propositions give more 
appealing characterizations of the liminf and lim sup of a sequence of sets. Nothing is lost if you 
choose to regard these propositions (once proved) as the definitions of liminf and lim sup. 
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7.3.2. Proposition. // (An) is a sequence of sets, then an element x belongs to liminf ^„ if and 

only if X is eventually in the sequence (An). 

7.3.3. Proposition. If (An) is a sequence of sets, then an element x belongs to limsupA„ if and 
only if X is frequently in the sequence {An). 

7.3.4. Proposition. If (An) is a sequence if sets, then U^^^flfcln^fc ^ fl^iUfcln^fe- 

7.3.5. Exercise. For each n S N let An be the interval (— ^, l] whenever n is odd and the interval 
(—1, ^] whenever n is even. Find liminf and lim sup An' 

7.3.6. Exercise. For each n G N, let An be the open ball in M? of radius 1 centered at the point 
^ ' , ) . Find lim inf An and lim sup An ■ 



n ^ 

7.3.7. Exercise. Let On = ^ ^, In = {an - [a™], On+i - [««.]), and An = ((0, 1] \ In) U (l + I„) 

k=l 

for all n gN. (Here [x] denotes the greatest integer that is less than or equal to x.) Find lim inf A„ 
and lim sup A„. 

7.3.8. Proposition. If {An) is a sequence of sets, then 

lim inf An'^ = (lim sup An)'^ 

and 

lim sup ^n"^ = (liminf . 

7.3.9. Definition. A sequence {Ak) of sets is increasing if Ak ^ Ak+i for each k. It is decreas- 
ing if Ak 5 Ak+i for each k. It is monotone if it is either increasing or decreasing. 

7.3.10. Proposition. If {A^) is a monotone sequence of sets, then lim A^. exists. 

oo oo 



7.3.11. Exercise. Explain why lim sup is an appropriate name for || I J Aj^. Also, explain 
why lim inf A^ is an appropriate name for |^ A^. 



71=1 k= 
n=l k=n 



In the following proposition A Ak denotes lim A Ak where A Ak = Ai A A2 A ■ ■ ■ A An. 

k=i "^<»fc=i k=i 



7.3.12. Proposition. Suppose that (Ak) is a sequence of sets. Then A Ak exists if and only if 

fc— >oo ■ 



hmfc_,^ A' - ^==1 



7.4. Limit Inferior and Limit Superior (for Real Numbers) 

In this section we introduce "lim inf" and "lim sup" for sequences of real numbers. Notice how 
closely the definitions and some of the results here parallel those in the preceding section. 

7.4.1. Convention. In section 5.2 we introduced the notation /\A for the infimum of a subset 
A of a partially ordered set. In order for this infimum to exist it is necessary that A be bounded 
below. If ^ is a subset of M and is not bounded below, we will write /\A = —00 as a notational 

convention. In, particular, if {a„} is a denumerable set of real numbers which is not bounded below 

00 

we write /\ o„ = — 00 or /\ a„ = — 00 or just /\ a„ = —00. Similarly, if ^ is a subset of M which 

n=l n 

is not bounded above we will write \J A = 00. In, particular, if {a„} is a denumerable set of real 

00 

numbers which is not bounded above we write V a„ = 00 or \/ a„ = 00 or \/ a„ = 00. 

n=l n 
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7.4.2. Definition. If (an)^i is a sequence of real numbers, then the LIMIT INFERIOR (or LOWER 

limit) of the sequence is defined by 



Uminf On := V A Ofe • 
n— >-oo y I \ 

n=l k=n 

Similarly, the limit superior (or upper limit) of the sequence is defined by 

oo oo 

limsupa„ := A \J ak ■ 

n=l k=n 

If lim inf a„ and lim sup are equal, then we say that the limit of the sequence exists and is equal 
to their common value. If Z = lim inf a„ = lim sup a„, then we write 

lim an = I 

n-+oo 

or 

an I as n — >■ oo 

or if no confusion seems likely just 

a „ I . 

Whenever there exists a real number / such that an ^ I we say that the sequence (a„) converges. 
Otherwise it DIVERGES. 

As with sets we usually write lim inf a„ for lim inf a„, lim sup for lim sup a„, and lima„ for 
lim an. 

n->oo 

7.4.3. Convention. When one says that the suprcmum (or infimum) of a set of real numbers exists 
or that the limit of a sequence of real numbers exists one always means exists as a real number. 
Thus even though it is correct to write /\Z = — oo, we still say that the infimum of the set of 
integers does not exist. Similarly, while it is correct to write lim„_).oo = oo, we nevertheless say 
that the limit of the sequence (ra^)^^ does not exist. 

There are alternative characterizations of the lim inf and lim sup of a sequence of real numbers. 

7.4.4. Proposition. // (a,„) is a sequence of real numbers, then a real number I is the lim inf a„ if 
and only if for every e > the sequence {an) is eventually greater than I — e and is frequently less 
than I + e. Furthermore, lim inf a„ = — oo if and only if the sequence (a^) is not bounded below. 

7.4.5. Proposition. If {on) is a sequence of real numbers, then a real number I is the lim sup if 
and only if for every e > the sequence (an) is eventually less than I + e and is frequently greater 
than I — e. Furthermore, lim sup a„ = oo if and only if (an) is not bounded above. 

7.4.6. Proposition. // (a„) is a bounded sequence of real numbers then lim inf a„ < lim sup a„. 

7.4.7. Convention. Notice that in the preceding exercise the hypothesis that the sequence be 
bounded is not really necessary — provided that we extend the partial ordering on the real numbers 
to the EXTENDED REAL NUMBERS M := M U {— oo,oo} by agreeing that — oo < oo and that both 
— oo < a and a < oo hold for every real number a. It is conventional also to extend the algebraic 
operations on M to these new objects — oo and oo. Addition is extended the way you would expect: 
it is assumed to be commutative and if a is a real number, then a + oo = oo and a + (— oo) = — oo. 
(The expression oo + (— oo) is left undefined.) If a is a real number and a > 0, then we take 
multiplication to be commutative and associative and set 

a • oo = oo • oo = (— oo) • (— oo) = (—a) • (— oo) = oo . 

Multiplication of the new objects by negative numbers can be taken care of by agreeing that 
(— 1) • oo = — oo and (—1) • ( — oo) = oo. The only surprising (and, if we are not careful, treacherous) 
convention is setting • oo = 0. The rationale for this is, speaking very roughly, that in measure 
theory we wish the "area" of a line in the plane (considered as an "infinitely long rectangle of zero 
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width" ) to be zero. It should be emphasized that these conventions do not turn the extended reals 
into any sort of interesting algebraic object. Addition on M is not even a binary operation (since 
DO + (—00) is not defined). 

The liminf and limsup functions (from sequences of real numbers to the extended reals) are 
monotone. We make this precise in the next proposition. 

7.4.8. Proposition. Let (an) and (6„) be sequences of real numbers. If eventually an < bn, then 
lim inf a„ < lim inf 6„ and lim sup a„ < lim sup 6„ . 

7.4.9. Remark. It is worthwhile to take time out to notice the role played in stating the preceding 
proposition by the conventions adopted in 7.4.7 concerning the symbols —00 and 00. With the 
conventions in place it is completely irrelevant whether either sequence (a„) or (6„) is bounded. 
Were it not for these conventions, communicating the same information would require something 
like the following. 

7.4.10. Proposition. Let [an) and {bn) be sequences of real numbers such that eventually an < bn- 

(a) // [an) is bounded below then so is {bn) and liminf (x^ ^ liminf 6^. 

(b) // (bn) is bounded above then so is (on) and lim sup a„ < limsup 

It is a matter of individual taste whether or not the economy achieved by adopting these 
conventions for algebraically challenged "numbers" like —00 and 00 is worth the price. (After all, 
M is a field; M is (algebraically) not much of anything.) 

7.4.11. Proposition. i/(a„) is a bounded sequence of real numbers, then limsup(— a„) = — liminf a. 
and lim inf (—a„) = — limsupa„. 

Hint for proof . Use proposition 6.1.8. 

7.4.12. Exercise. We can say slightly more: exercise 7.4.11 remains true even if the word "bounded" 
is deleted. Why? How can we communicate this extra information if we choose not to make use of 
the conventions in 7.4.7? 

7.4.13. Proposition. // (a„) and {bn) are bounded sequences of real numbers, then 

lim inf a„ + lim inf 6„ < lim inf (a„ + &«) < lim sup(a„ + bn) < lim sup a„ + lim sup bn ■ 

7 AAA. Example. In the preceding proposition each of the inequalities may be strict. (We say 
that inequalities of the form x < y and x > y, which fail to be reflexive, are strict inequalities. We 
distinguish these from weak inequalities of the form x < y and x > y.) 

7.4.15. Example. Show by example that proposition 7.4.13 is not true if the word "bounded" is 
removed from the hypothesis. Can you think of a way to fix up the statement of the result so that 
it holds for (at least some) not necessarily bounded sequences? 

7.4.16. Proposition. // (a„) and {bn) are sequences of real numbers such that a,j — )■ / G M and 

bn ^ rn gM as n ^ 00, then 

lim(an + bn) I + m as n ^ 00. 

7.4.17. Proposition. If (on) and {bn) are bounded sequences of positive real numbers, then 
(liminf a„) (lim inf 6„) < liminf(a„6„) < limsup(a„6„) < (lim sup a„) (lim sup 6„) . 

7.4.18. Example. In the preceding proposition each of the inequalities may be strict. 

7.4.19. Example. Show that proposition 7.4.17 is not true if the word "bounded" is removed 
from the hypothesis. Can you think of a way to fix up the statement of the result so that it holds 
for (at least some) not necessarily bounded sequences? 
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7.4.20. Proposition. If [an) and are sequences of positive real numbers such that a„ — )• / G M 
and bn ^ m & M as n ^ oo, then 

lim(an6„) — )■ Im as n ^ oo. 

7.4.21. Definition. If a is a real number and e > 0, then the open interval (a — e,a + e) is the 

e-NEiGHBORHOOD of the point o. 

7.4.22. Proposition. If (a„) is a sequence of real numbers and c G M, then lima^ = c if and only 
if the sequence (an) is eventually in every e -neighborhood of c. 

7.4.23. Example. If c> in M, then c^/" 1 as n — )• oo. 

Hint for proof . Treat the cases c > 1 and < c < 1 separately. 

7.4.24. Proposition. //a„, > for every n G N, then 

lim inf ^lil < lim inf ^/a^ < lim sup ^/a^ < lim sup — 5ii . 

Hint for proof . For the first inequality let p = liminf a„-|-i/a„. If p > choose a between 
and p. Argue that for some natural number no we have Uk+i/ak > a whenever k > uq. Let n > uq. 
Show that an/ ana > a"""''. Use example 7.4.23 to show that lim inf ^ya^ > a. 

7.4.25. Example. Each of the inequalities in the preceding exercise may be strict. 

7.4.26. Exercise. Recall (or look up) statements of the ratio test and the root test from beginning 
calculus. Explain carefully what is meant by, and justify, the assertion that the root test is stronger 
than the ratio test. 

7.4.27. Example. If a„ = ^ , then a„ — )• e as n — )• oo. 

Hint for proof . You may assume without proof that e = lim(l + ^) . 

7.4.28. Definition. A sequence (ofc) of real numbers is increasing if < Ofe+i for each k. It is 
DECREASING if > afe+i for each k. It is monotone if it is either increasing or decreasing. 

7.4.29. Proposition. Every bounded monotone sequence of real numbers converges. 



7.4.30. Exercise. Explain why lim sup is an appropriate name for A \J a^. Also, explain 

oo oo 1 

V» n=l k=n 

l\ ak. 

n=l k=n 

7.4.31. Example. The sequence ^\/2, \/2 + \/2, y'2~+\/2^+^, . . . ^ converges. 

7.4.32. Example. Let xi = —10 and Xn+i = 1 — i/l ~ for every n > 1. Then (xn) converges 
and so does lim 



n— s>oo Xr 



In the next proposition the expressions limsupX/i > lim inf , and limx^ are to be taken 

/ift -^n 

pointwise. If 5 is a real valued function on S and (/„) is a sequence of real valued functions on S, 
we write lim sup /„ = 5 if limsup/„(a;) = g(x) for every x £ S. 

7.4.33. Proposition. Let (An) be a sequence of subsets of a nonempty set S. Show that 

(a) limsupx^^ =Xu^,,p^„ 

(b) lim inf =Xu^infA„. 

(c) limj4„ exists if and only i/limx . exists. If these limits exist then 

lim Y . = y,. . 
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7.5. Subsequences and Cluster Points 

7.5.1. Definition. Let (an) be a sequence of members of a set S. If {nk)^=i is a strictly increasing 
sequence in N, then the sequence 

(Ortfc ) = (fflrife )k=l ~ ('^"1 ' '^"■2 > '^"s ) • • • ) 

is a SUBSEQUENCE of the sequence (on). 

7.5.2. Example. If = and bk = 4"^^ for all /c G N, then (6^) is a subsequence of the 
sequence (a^). Intuitively, this is clear, since the second sequence (5, , 13, • • • ) just picks out the 
even- numbered terms of the first sequence ^, ^, j^, . . .). This "picking out" is implemented by 
the strictly increasing function n{k) = 2k (for k gN). Thus b = aon since 

a„, = a{n{k)) = a{2k) = 2"^^ = 4-^ = 5^ 

for all in N. 

7.5.3. Definition. Let (a„) be a sequence of real numbers. A point c G M is a cluster point of 
the sequence (0^) if the sequence is frequently in every e-neighborhood of c. 

There are two observations which indicate how useful cluster points can be. One is that cluster 
points of sequences of real numbers are just the limits of subsequences. The other is that the liminf 
of a sequence is the smallest cluster point of that sequence and the limsup is its largest. This is 
the content of the next two proposition. 

7.5.4. Proposition. // (a„) is a sequence of real numbers and c £ R, then c is a cluster point of 
the sequence if and only if there is a subsequence of {an) which converges to c. 

7.5.5. Example. The sequence + cos + sin ^)'^_^ has four cluster points in M^. 

7.5.6. Proposition. Let (an) be a bounded sequence of real numbers and C be the set of cluster 
points of (an)- Then liminf a„ G C, limsupa„ G C, and for every c £ C 

lim inf a„ < c < lim sup a„ . 

Many authors choose to extend the notion of cluster points to M. Thus when a sequence is not 
bounded above we may regard 00 as a cluster point of the sequence; and when it is not bounded 
below we take —00 to be a cluster point. The convenience of adopting this convention is that then 
the hypothesis that (a„) be bounded may be removed from proposition 7.5.6. 

7.5.7. Proposition. Every bounded sequence of real numbers has a convergent subsequence. 
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CATEGORIES 

In mathematics we study things (objects) and certain mappings between them (morphisms). 
To mention just a few, sets and functions, groups and homomorphisms, topological spaces and 
continuous maps, vector spaces and linear transformations, and Hilbert spaces and bounded linear 
maps. These examples come from different branches of mathematics — set theory, group theory, 
topology, linear algebra, and functional analysis, respectively. But these different areas have many 
things in common: products, coproducts, quotients, and the like. Category theory is an attempt 
to formalize some of these common constructions. Very loosely speaking, category theory is the 
study of what different branches of mathematics have in common. 

In these notes just as we use the language of sets without pursuing the study of set theory, 
we will cheerfully use the language of categories without first embarking on a preliminary study of 
category theory (which itself is a large and important area of research). It can sometimes be a bit 
challenging to learn even the language of categories because many of the introductory treatments 
lean heavily on one's algebraic background. For a very gentle entree to the world of categories try 
Chapter 3 of Semadeni's beautiful book [ ] on Banach Spaces of Continuous Functions. Another 
standard text is the book Categories for the Working Mathematician by one of the founders of the 
subject, Saunders Mac Lane. 

By pointing to unifying principles the language of categories often provides striking insight into 
"the way things work" in mathematics. Equally importantly, one gains in efficiency by not having 
to perform essentially the same construction over and over in slightly different contexts. 

8.1. Objects and Morphisms 

8.1.1. Definition. Let 21 be a class, whose members we call objects. For every pair {S,T) of 
objects we associate a set 'lOlot{S, T), whose members we call morphisms from S to T. We assume 
that mox{S, T) and Wtor(C/, V) are disjoint unless S* = [/ and T = F. 

We suppose further that there is an operation o (called composition) that associates with 
every a G S!Jtor(S', T) and every /3 G 5[Ror(T, U) a morphism /3 o a g 9Hor(5', U) in such a way that: 

(1) 7 o (/3 o a) = (7 o ^) o a whenever a G 93tor(5, T), (5 G mox{T, U), and 7 G 9Jtor(f7, V)] 

(2) for every object S there is a morphism Is G 9Jtor(S', S) satisfying a o Ig = a whenever 
a G mox{S, T) and Is 0/3 = 13 whenever (3 G Tlot{R, S). 

Under these circumstances the class 21, together with the associated families of morphisms, is a 
CATEGORY. 

We will reserve the notation S — T for a situation in which S and T are objects in some 
category and a is a morphism belonging to 9Jtor(S', T). As is the case with groups and vector spaces 
we usually omit the composition symbol o and write l3a for (3 o a. 

8.1.2. Example. The category SET has sets for objects and functions (maps) as morphisms. 

8.1.3. Example. The category AbGp has Abelian groups for objects and group homomorphisms 
as morphisms. (See proposition 4.3.11.) 

8.1.4. Example. The category VEC has vector spaces for objects and linear transformations as 
morphisms. (See proposition 4.5.6.) 
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8.1.5. Example. The category ALG has algebras for objects and algebra homomorphisms for 
morphisms. (See proposition 4.7.5. 

The preceding examples are examples of concrete categories — that is, categories in which the 
objects are sets (together, usually, with additional structure) and the morphism are functions 
(usually preserving this extra structure) . In these notes the categories of interest to us are concrete 
ones. Here (for those who are curious) is a more formal definition of concrete category. 

8.1.6. Definition. A category 21 together with a function □ which assigns to each object A in 21 
a set OA is a concrete category if the following conditions are satisfied: 

/ 

(i) every morphism A is a function from UA to OB; 

(ii) each identity morphism I a in 21 is the identity function on OA; and 

(iii) composition of morphisms A — ^-^B and B — agrees with composition of the functions 
f:OA^OB and g:OB^ OC. 

If A is an object in a concrete category 21, then 0^4 is the underlying set of A. 

Although it is true that the categories of interest in these notes are concrete categories, it may 
nevertheless be interesting to see an example of a category that is not concrete. 

8.1.7. Example. Let G be a monoid. Consider a category C having exactly one object, which 
we call -k. Since there is only one object there is only one family of morphisms 5[Ror(*,*), which 
we take to be G. Composition of morphisms is defined to be the monoid multiplication. That is, 
a o b := ab for all a, b £ G. Clearly composition is associative and the identity element of G is the 
identity morphism. So C is a category. 

8.1.8. Definition. In any concrete category we will call an injective morphism a monomorphism 
and a surjective morphism an EPIMORPHISM. 

8.1.9. CAUTION. The definitions above reflect the original Bourbaki use of the term and are 
the ones most commonly adopted by mathematicians outside of category theory where "monomor- 
phism" means "left cancellable" and "epimorphism" means "right cancellable". (Notice that the 
terms injective and surjective may not make sense when applied to morphisms in a category that 
is not concrete.) 

A morphism B — ^ C is left cancellable if whenever morphisms A — B and A — B 
satisfy gfi = gf2, then /i = /2- Mac Lane suggested calling left cancellable morphisms monic 
morphisms. The distinction between monic morphisms and monomorphisms turns out to be slight. 
In these notes almost all of the morphisms we encounter are monic if and only if they are monomor- 
phisms. As an easy exercise prove that any injective morphism in a (concrete) category is monic. 
The converse sometimes fails. 

In the same vein Mac Lane suggested calling a right cancellable morphism (that is, a morphism 

A — ^-^B such that whenever morphisms B — ^^^C and B — satisfy gif = (72/, then gi = g2) an 
epic morphism. Again it is an easy exercise to show that in a (concrete) category any epimorphism 
is epic. The converse, however, fails in some rather common categories. 

8.1.10. Definition. The terminology for inverses of morphisms in categories is essentially the 

same as for functions (see section 2.10). Let S — '^^T and T — be morphisms in a category. If 
13 o a = Is, then /3 is a left inverse of a and, equivalently, a is a right inverse of /3. We say 

that the morphism a is an ISOMORPHISM (or is invertible) if there exists a morphism T — S 
which is both a left and a right inverse for a. Such a function is denoted by and is called the 
INVERSE of a. 

In any concrete category one can inquire whether every bijective morphism (that is, every map 
which is both a monomorphism and an epimorphism) is an isomorphism. The answer is often 
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a trivial yes (as in propositions 4.3.4, 4.5.11, 4.7.3, and the next two examples) or a trivial no 
(for example, in the category POSET of partially ordered sets and order preserving maps, see 
exercise 5.2.11). But on occasion the answer turns out to be a fascinating and deep result (see 
section 29.4). 

8.1.11. Example. In the category SET every bijective morphism is an isomorphism. 

8.1.12. Example. The category LAT has lattices as objects and lattice homomorphisms as mor- 
phisms. In this category bijective morphisms are isomorphisms. 

8.1.13. Example. If in the category C of example 8.1.7 the monoid G is a group then every 
morphism in C is an isomorphism. 

8.2. Quotients 

Proposition 2.11.7 illustrates an important class of categorical constructions — quotients. The 
following definition evinces a very general attitude towards quotients. It specifies what quotients 
do — not what they are. This definition, like many very abstract definitions, may at first seem 
rather unmotivated and devoid of intuitive content. Throughout these notes you will encounter 
many quotient constructions. My advice is to learn the definition first, and then check it against 
the examples as they come along. A comfortable understanding of the idea of quotient will come 
at the end of this process. Don't demand it up front. For those with serious allergies to abstract 
notions an alternative strategy is to file the definition away until you have seen enough examples 
that you feel you are ready for the general notion. 

8.2.1. Definition. Let A be an object in a category C. An epimorphism A — ^ B in C is a 
QUOTIENT MAP for yl if a map g: B ^ C (in SET) is a morphism (in C) whenever g o vr is a 
morphism. An object S in C is a quotient object for A if it is the range of some quotient map 
for A. 

8.2.2. Example. In the category SET every epimorphism is a quotient map. 

8.2.3. Example. In the category AbGp every epimorphism is a quotient map. 

In the category SET there is a standard quotient object for a set S created whenever an 
equivalence relation is specified on S. That is the point of proposition 2.11.7. Similarly, in the 
category AbGp a standard quotient object for a group G is created whenever a subgroup of G is 
specified. Here is how the construction goes. 

8.2.4. Definition. Let H he a, subgroup of an Abelian group G. Define an equivalence relation ~ 
on G by 

a; ~ y if and only if y — x £ H. 
For each x S G let [x] be the equivalence class containing x. Let G/H he. the set of all equivalence 
classes of elements of G. For [x] and [y\ in G/H define 

[x] + [y] := [x + y]. 

Under this operation G/H becomes an Abelian group. It is the quotient group (or factor 
group) of G by H. The notation G/H is usually read "G mod H'\ The linear map 

tt: G^G/H: x^ [x] 

is called the quotient map. 

8.2.5. Exercise. Several things in the preceding definition require proof. Show that: 

(a) ~ is an equivalence relation. 

(b) Addition of equivalence classes is well defined. 

(c) G/H is an Abelian group. 

(d) The map vr is a quotient map (in the sense of definition 8.2.1). 
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(e) The space G/H is a quotient object. 

The most important theorem about quotients in the category AbGp is frequently called the 

fundamental homomorphism theorem or the first isomorphism theorem. 

8.2.6. Theorem (Fundamental quotient theorem for AbGp). Let G and J be Ahelian groups and 
H he a subgroup of G. If f is a homomorphism from G to J and ker/ 5 H, then there exists a 
unique homomorphism f : G/H ^ J which makes the following diagram commute. 

G 

vr 

G/H ^ J 

/ 

Furthermore, f is injective if and only if ker / = H ; and f is surjective if and only if f is. 

Everything works in vector spaces almost exactly as it does in Abelian groups. There are just 
a few more details. 

8.2.7. Example. Prove that in the category VEC every epimorphism is a quotient map. 

8.2.8. Definition. Let M be a subspace of a vector space V. Define an equivalence relation ~ on 
Vhy 

X ~ y if and only if y — x € M. 

For each x £ V let [x] be the equivalence class containing x. Let V/M be the set of all equivalence 
classes of elements of V. For [x] and [y] in V/M define 

[x] + [y] := [x + y] 

and for a € M and [x] £ V/M define 

a[x] := [ax] . 

Under these operations V/M becomes a vector space. It is the quotient space (or factor space) 
of V by M. The notation V/A4 is usually read "V mod M". The linear map 

TT-.V^ V/M : x^[x] 

is a quotient map. 

8.2.9. Exercise. Prove the facts necessary for the preceding definition to make sense. In particular, 
show that: 

(a) ~ is an equivalence relation. 

(b) Addition of equivalence classes is well defined. 

(c) Multiplication of an equivalence class by a scalar is well defined. 

(d) V/M is a vector space. 

(e) The map vr is a quotient map (in the sense of definition 8.2.1). 

(f) The space V/M is a quotient object. 

8.2.10. Exercise. In the definition of a quotient vector space above take 1/ = and M to be the 
z-axis. Describe the resulting equivalence classes and exhibit an isomorphism between V/M and 
M^. Next, take M to be the xy-plane. Describe the resulting equivalence classes and explain why, 
in this case, V/M is isomorphic to M. 

Theorems 2.11.9 and 8.2.6 have an obvious analogue in the category of vector spaces and linear 
maps. 
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8.2.11. Theorem (Fundamental quotient theorem for VEC). Let V and W be vector spaces and 
M be a subspace ofV. If T is a linear map from VtoW and ker T D M , then there exists a unique 
linear map T: V/M — )• W which makes the following diagram commute. 

V 

IT 

V/M 

T 

Furthermore, T is injective if and only if ker T = M; and T is surjective if and only if T is. 

Do not be tempted on the basis of 8.2.2, 8.2.3, and 8.2.7 to conjecture that in every category 
surjective morphisms are automaticahy quotient maps. We wih encounter a simple counterexample 
in example 11.6.6. 

8.2.12. Definition. Let J be a proper ideal in an algebra A. Define an equivalence relation ~ on 
^by 

a ^ b if and only if b — a ^ J. 
For each a G ^ let [a] be the equivalence class containing a. Let A/ J be the set of all equivalence 
classes of elements of A. For [a] and [b] in A/ J define 

[a] + [b]:=[a + b] and [a] [b] := [ab] 

and for a G C and [a] £ A/ J define 

a[a] := [aa] . 

Under these operations A/ J becomes an algebra. It is the quotient algebra of A by J. The 
notation A/ J is usually read "A mod J". The surjective algebra homomorphism 

t:: A ^ A/ J: a t-^ [a] 

is called the quotient map. 

8.2.13. Exercise. Verify the assertions made in the preceding definition. 

8.3. Products 

Products and coproducts, like quotients, are best described in terms of what they do. 

8.3.1. Definition. Let Ai and A2 be objects in a category C. We say that the triple (P, 7ri,7r2), 
where P is an object and vr^ : P — t- (/c = 1, 2) are morphisms, is a product of Ai and A2 if for 
every object B and every pair of morphisms fk.B^Ak{k = l, 2) there exists a unique morphism 
g: B ^ P such that fk = "^k ° 9 for k = 1,2. 

It is conventional to say, "Let P be a product of ... " for, "Let (P, vri, 7r2) be a product of ..." . 
The product of Ai and A2 is often written as Ai x A2 or as 11^=1 2 ^k- 

In a particular category products may or may not exist. It is an interesting and elementary fact 
that whenever they exist they are unique (up to isomorphism) , so that we may unambiguously speak 
of the product of two objects. When we say that a categorical object satisfying some condition(s) is 
unique up to isomorphism we mean, of course, that any two objects satisfying the condition(s) must 
be isomorphic. We will often use the phrase "essentially unique" for "unique up to isomorphism." 

8.3.2. Proposition. In any category products (if they exist) are essentially unique. 

8.3.3. Example. In the category SET the product of two sets Ai and A2 exists and is in fact the 
usual Cartesian product Ai x A2 together with the usual coordinate projections irk'. Ai x A2 ^ 
Ak - (01,02) H> Ofc. 
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8.3.4. Example. If Vi and V2 are vector spaces we make the Cartesian product Vi x V2 into a 
vector space as follows. Define addition by 

{u, v) + {w, x) := {u + WjV + x) 

and scalar multiplication by 

a{u, v) := {au, av) . 

This makes Vi x V2 into a vector space and that this space together with the usual coordinate 
projections vr/j : Vi x V2 — >• : {vi,V2) t-^ Vk {k = 1,2) is a product in the category VEC. It is 
usually called the direct sum of V and W and is denoted by ^ © 1^. 

It often possible and desirable to take the product of an arbitrary family of objects in a category. 
Following is a generalization of definition 8.3.1. 

8.3.5. Definition. Let {^\) x^a indexed family of objects in a category C. We say that the 
object P together with an indexed family ('''"a);^^^ of morphisms ttx: P ^ Ax is a product of the 
objects Ax if for every object B and every indexed family {fxjx^A °^ morphisms fx'- B ^ Ax there 
exists a unique map g: B ^ P such that f\ = t^\o g for every A € A. 

A category in which arbitrary products exist is said to be product complete. Many of the 
categories we encounter in these notes are product complete. 

8.3.6. Definition. Let ('S'a);^^;^^ be an indexed family of sets. The Cartesian product of the 
indexed family, denoted by OasA J^^^ H 'S'a, is the set of all functions / : A — |J S'a such that 
/(A) G 5a for each A G A. The maps tta : H •S'a : / 1-^ /(A) are the canonical coordinate 
PROJECTIONS. In many cases the notation fx is more convenient than /(A). (See, for example, 
example 8.3.8 below.) 

8.3.7. Example. A very important special case of the preceding definition occurs when all of the 
sets Sx are identical: say ^a = ^ for every A E A. In this case the Cartesian product comprises all 
the functions which map A into A. That is, HasA -^a = ( see exercise 3.2.18). Notice also that 
in this case the coordinate projections are evaluation maps. For each A the coordinate projection 
TTA takes each point / in the product (that is, each function / from A into A) to /(A) its value at A. 
Briefly, each coordinate projection is an evaluation map at some point. 

8.3.8. Example. What is M"? It is just the set of all n-tuples of real numbers. That is, it is the 
set of functions from N„ = {1, 2, . . . , n} into M. In other words M" is just shorthand for M^". In 

one usually writes Xj for the j*^ coordinate of a vector x rather than x{j). 

8.3.9. Example. In the category SET the product of an indexed family of sets exists and is in 
fact the Cartesian product of these sets together with the canonical coordinate projections. 

8.3.10. Example. If (^a);,^^ is an indexed family of vector spaces we make the Cartesian product 
W Vx into a vector space as follows. Define addition and scalar multiplication pointwise: for /, 
g ^\[Ax and a G M 

(/ + 5)(A) ■.= f{\) + g{\) 

and 

(a/)(A) :=a/(A). 

This makes Y\ Vx into a vector space, which is sometimes called the direct product of the spaces 
Va, and this space together with the canonical coordinate projections (which are certainly linear 
maps) is a product in the category VEC. 

8.3.11. Exercise. Prove or disprove: 

(a) The Cartesian product of a finite family of countable sets is countable. 

(b) The Cartesian product of a countable family of finite sets is countable. 

Not every category is product complete. 
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8.3.12. Exercise. Show that in the category of all nonempty subsets of M and inclusion maps ( 
Mox{A, B) = <Il a A <^ B) products need not exist. 

8.4. Coproducts 

Coproducts are like products — except all the arrows are reversed. 

8.4.1. Definition. Let Ai and A2 be objects in a category C. The triple {P,ji,j2), [P is an object 
and jk'- Ak ^ P , k = 1,2 are morphisms) is a COPRODUCT of Ai and A2 if for every object B and 
every pair of morphisms gk'- Ak ^ B {k = 1,2) there exists a unique morphism h: P ^ B such 
that gk = ho jj^ for k = 1,2. 

8.4.2. Proposition. In any category coproducts (if they exist) are essentially unique. 

8.4.3. Definition. In definition 1.4.6 we introduced the notation A^B for the union of two disjoint 
sets A and B. We now extend this notation somewhat and allow ourselves to take the disjoint union 
of sets A and B even if they are not disjoint. We "make them disjoint" by identifying the set A 
with the set A' = {{a,l) : a € A} and B with the set B' = {{b,2): b £ B}. Then we denote the 
union of A' and B', which are disjoint, hy A^ B and call it the disjoint union of A and B. 

In general, if (^a);^^^ is an indexed family of sets, its disjoint union, l+J Ax is defined to be 
U{(A,A): AeA}. ^eA 

8.4.4. Example. In the category SET the coproduct of two sets Ai and A2 exists and is their 
disjoint union Ai tt) A2 together with the obvious inclusion maps Lk- Aj- ^ Ai^tl A2 [k = 1, 2). 

It is interesting to observe that while the product and coproduct of a finite collection of objects 
in the category SET are quite different, in the more complex category VEC they turn out to be 
exactly the same thing. 

8.4.5. Example. If Vi and V2 are vector spaces make the Cartesian product Vi XV2 into a vector 
space as in example 8.3.4. This space together with the obvious injections is a coproduct in the 
category VEC. 

It often possible and desirable to take the coproduct of an arbitrary family of objects in a 
category. Following is a generalization of definition 8.4.1. 

8.4.6. Definition. Let (^a);^^^^^ be an indexed family of objects in a category C. We say that the 
object C together with an indexed family {I'x) ^^^^ °f morphisms l\: A\ — t- C is a coproduct of 
the objects Ax if for every object B and every indexed family {fx)x£A morphisms fx'- Ax ^ B 
there exists a unique map g: C ^ B such that fx=g°'-x for every A G A. The usual notation for 
the coproduct of the objects Ax is UasA ^a- 

8.4.7. Example. In the category SET the coproduct of an indexed family of sets exists and is 
the disjoint union of these sets. 

8.4.8. Definition. Let 5 be a set and F be a vector space. The support of a function / : 5 — )• F 
is{sGS: /(s)/0}. 

8.4.9. Example. If (^a);,^^ 

is an indexed family of vector spaces we make the Cartesian product 
into a vector space as in example 8.3.10. The set of functions / belonging to Va which have 
finite support (that is, which are nonzero only finitely often) is clearly a subspace of J^V^. This 
subspace is the direct sum of the spaces Vx. It is denoted by 0;^^^ This space together with 
the obvious injective maps (which are linear) is a coproduct in the category VEC. 
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ORDERED VECTOR SPACES 

9.1. Partially Orderings on Vector Spaces 

9.1.1. Definition. A partial ordering < on a real vector space V is compatible with (or re- 
spects) tiie operations (addition and scalar multiplication) on V if for all x, y, z £ V 

(a) X < y implies x + z < y + z, and 

(b) X < y, a > imply ax < ay. 

A real vector space equipped with a partial ordering which is compatible with the vector space 
operations is an ordered vector space. 

9.1.2. Example. The real line M (with its usual operations and partial ordering) is an ordered 
vector space. 

9.1.3. Exercise. The usual ordering on M cannot be extended to a partial ordering on the complex 
plane C which respects both addition and multiplication. (A partial ordering on a ring respects 
multiplication if ab is positive whenever both a and b are.) 

9.1.4. Example. If S" is a nonempty set the family J-{S) of real valued functions on S is an ordered 
vector space under the pointwise operations defined in example 4.4.7 and the partial ordering defined 
in example 5.1.8. This is the usual ordering on T{S). Explain how this makes M" and s = s{R), 
the set of all sequences of real numbers, into ordered vector spaces. 

Proposition 6.1.9 is a special case of the next result. The proof however may be quite different 
from the one you gave in 6.1.9. 

9.1.5. Proposition. Let A and B be nonempty subsets of an ordered vector space. If A and B 

both have least upper bounds, then so does A + B and 

sup{A + B) = sup A + sup B . 

9.1.6. Proposition. Let A be a nonempty subset of an ordered vector space. If A has a least upper 
bound and a > 0, then aA has a least upper bound and 

sup{aA) = a sup A . 

The result stated for sets of real numbers in exercise 6.1.8 holds in ordered vector spaces. 

9.1.7. Proposition. Let A be a nonempty subset of an ordered vector space. If A has a greatest 
lower bound, then —A has a least upper bound and 

sup(-A) = -inf ^. 

9.1.8. Exercise. Show how to use the preceding exercise to prove that if A and B are nonempty 
subsets of an ordered vector space and if A and B both have greatest lower bounds, then so does 
A + B and 

mf{A + B) = inf A + MB. 

It is a good idea to be alert for uses of special cases of the preceding very general results. For 
example, 

(a + 6) A (a + c) = a + (6 A c) 
is an identity which occurs frequently. It is just exercise 9.1.8 where A = {a} and B = {b,c}. 
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9.2. Convexity 

9.2.1. Definition. Let be a vector space. Recall that a linear combination of a finite set 
{xi, . . . , Xn} of vectors in y is a vector of the form Y12=i (^k^k where ai, . . . , a„ G M. If ai = 02 = 
• • • = a„ = 0, then the linear combination is trivial; if at least one is different from zero, the 
linear combination is nontrivial. A linear combination X]fc=i "^fc^fc °f vectors xi,...,Xn is a 
CONVEX COMBINATION if Ofe > for each A; (1 < A; < n) and if Y^2=i "fc = 1- 

9.2.2. Definition. If a and 6 are vectors in the vector space V, then the closed segment between 
a and b, denoted by [a, b], is {(1 — t)a + tb: < t < 1}. 

9.2.3. CAUTION. Notice that there is a slight conflict between this notation, when applied to 
the vector space M of real numbers, and the notation introduced in section 1.7. In M the closed 
segment [a, b] is the same as the closed interval [a, b] provided that a < b. If a > 6, however, the 
closed segment [a, b] is the same as the segment [b, a] , it contains all numbers c such that b < c < a, 
whereas the closed interval [a, b] is empty. 

9.2.4. Definition. A subset C of a vector space V is convex if the closed segment [a,b] is 
contained in C whenever a, b £ C. 

9.2.5. Example. A disk is a convex subset of M^. The set {{x,y) : 1 < x^ + < 2} is not a convex 
subset of M2. 

9.2.6. Example. Every subspace of a vector space is convex. 

9.2.7. Definition. Let A be a subset of a vector space V. The convex hull of A is the smallest 
convex subset of V which contain A. 

9.2.8. Exercise. Show that the preceding definition makes sense by showing that the intersection 
of a family of convex subsets of a vector space is itself convex. Then show that a "constructive 
characterization" is equivalent; that is, prove that the convex hull of A is the set of all convex 
combinations of elements of A. (See remark 5.3.6.) 

9.3. Positive Cones 

9.3.1. Definition. Let V he a vector space. A subset C of y is a cone in V if aC C C for every 
a > 0. A cone C in F is proper if C n (-C) = {0}. 

9.3.2. Proposition. A cone C in a vector space is convex if and only if C + C C. 

9.3.3. Example. If V is an ordered vector space, then the set 

V+ := {x £V: x>0} 

is a proper convex cone in V. This is the positive cone of V and its members are the positive 
elements of V.) 

9.3.4. Proposition. Let V be a vector space and C be a proper convex cone in V. Define x < y 
if y — X G C. The relation < is a partial ordering on V and is compatible with the vector space 
operations on V. This relation is the partial ordering induced by the cone C. The positive cone 

of the resulting ordered vector space is just C itself. 

9.3.5. Definition. Let V be an ordered vector space. Its positive cone is generating if 
V = — V~^; that is, if every element of V can be written (in at least one way) as the difference 
of positive elements. 

9.3.6. Example. Let C = {(x, y) G : a; > and y > 0}. Then C IS a proper convex cone m M . 
If < is the partial ordering on M? induced by C then {u, v) < {x, y) if and only if tt < x and v <y. 
The cone C is generating. This is the usual ordering on M^. 
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9.3.7. Example. Let C = {{x,y) e M? : x > 0} U {{0,y) e : y > 0}. Then C is a proper convex 
cone in E?. If < is the partial ordering on induced by C then (u, v) < (x, y) if and only if either 
u < X OT else u = x and v < y. The cone C is generating. This is lexicographic ordering 
on E?. (See example 5.1.6.) 

9.3.8. Example. Let C = {(x,0) G M^. ^ > q|_ Tj^g^ q 

is a proper convex cone in M . If < is 
the partial ordering on M? induced by C then (u, v) < (x, y) if and only u < x and v = y. The cone 
C is not generating. 

9.3.9. Remark. It is quite easy to see that if the positive cone in an ordered vector space V 
is not generating, then W = — V~ is a partially ordered vector space contained in V and its 
positive cone, which is V~^, is generating. 

9.3.10. Example. In example 9.3.8 the positive cone C (the positive x-axis) is not generating. 
But the ordered vector space C — C generated by C is clearly the (entire) x-axis in M^. 

9.3.11. Example. Let 5 be a nonempty set and J-^{S) be the set of all real valued functions on S. 
In example 9.1.4 we found that under pointwise operations and partial ordering J-{S) is an ordered 
vector space. In this space the set C = {/ G J~iS) : /(x) > for all x G S"} is a proper convex cone. 
The partial ordering this cone induces on -^(5") is exactly the (pointwise) ordering introduced in 
example 5.1.8. 

9.4. Finitely Additive Set Functions 

9.4.1. Definition. Let 21 be an algebra of subsets of a set S. A real valued function ^ on 21 is a 

REAL FINITELY ADDITIVE SET FUNCTION on 21 if 

fi{A^ B) = n{A) + fi{B) 

whenever A and B are disjoint members of 21. 

A real valued finitely additive set function on an algebra 21 of sets is bounded if there exists a 
number M > such that 

< M 

for all A G 21. We denote by ba(S', 21) the set of all bounded real finitely additive set functions on 
2t. (If no confusion will result we shorten the notation to ba(S').) 

9.4.2. Proposition. If fi is a real finitely additive set function on an algebra of sets, then ^{^) = 0. 

9.4.3. Proposition. If ^ is a real finitely additive set function on an algebra 21 of sets and A, 
B G 21, then 

^l{A UB) + fi{A nB)= fi{A) + n{B) . 

9.4.4. Proposition. If fj, is a real finitely additive set function on an algebra 21 of sets and A, 
S G 21, then 

H{B\A) = fi{B)- n{AnB). 
If, additionally, A Q B, then the conclusion is 

f,iB\A) = fi{B)-fiiA). 

9.4.5. Definition. We say that a real valued function / on any set S is positive if /(s) > for 
all s £ S. In particular, a real finitely additive set function fi on an algebra 21 of sets is positive 
if n{A) > for all ^ G 21. 

The next result says that positive finitely additive set functions are COUNTABLY subadditive. 

9.4.6. Proposition. If fj, is a positive finitely additive set function on an algebra 21 of sets and if 
A\, . . . , An G 21, then 

(n \ n 

k=l ^ k=l 
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9.4.7. Example. Let 5 be a finite set and for every ^ C let fJ-{A) = card A. Then fiis a bounded 
real finitely additive set function on ^{S). 

It is a little hard to think of examples of unbounded finitely additive set functions, but here is 
one. (Another appears as a consequence of the Hahn-Banach theorem in example 29.1.17.) 

9.4.8. Example. Let (3 be the set of all intervals in M of the form (a, 6], (—00,6], or (a, 00). The 
family 21 of all finite unions of members of 6 is an algebra of subsets of M. Define on 6 by setting 
/x((a,6]) = 6 — a if a < 6, //((— oo,a)) = a, and 00)) = —b; then use finite additivity to extend 
/i to all of 21. This makes fi into a real finitely additive set function on 21. It is not bounded. 

9.4.9. Exercise. Give an example of an infinite set S and a bounded real finitely additive set 
function on ^{S). Make your example nontrivial by arranging things so that n{{s}) is different 
from zero for every s £ S. 

9.4.10. Example. Let 21 be an algebra of subsets of a set S. For /x, 1/ G ba(5') and a G M define 

{fi + u){A) = ^{A) + iy{A) and 
{afi){A) = an{A) 

for all A £ '01. Also let < ly if < i^{A) for all A in 21. Under these definitions ba(S') is an 
ordered vector space. 
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TOPOLOGICAL SPACES 

Most of (real) analysis is a study of the interplay of algebra and topology. Thus far in these 
notes there has been quite a bit of algebra, but no topology. Now we begin to remedy this unhappy 
situation. 

The classical part of topology which can be studied without the invocation of elaborate algebraic 
machinery is usually referred to as point-set topology. One excellent reference for this material is 
General topology [ ' ^] by Stephen Willard. A remarkable (and unusual) feature of this text is its 
thoughtfully designed and useful index. Another book I recommend highly is Albert Wilansky's 
Topology for Analysis [47]. The tables in the appendix, although a bit tricky to decipher until 
you get used to them, are invaluable in finding the relationships between an amazing 42 different 
topological properties that spaces may possess. I know of nothing like this in any other text. One of 
the first and best written topology texts is John L. Kelly's General Topology [27]. And, of course, 
there are many, many others — some quite good. 

10.1. Definition of Topology 

10.1.1. Definition. A family 1 of subsets of a set X is a topology on X if 
(i) and X belong to T; 

(h) if 6 C T, then {J eel; and 
(iii) if 6 is a finite subfamily of T, then P| S G T. 

The pair {X, T) is a topological space. As is by now familiar, we will abuse language and say 
things like, "Let X be a topological space." If {X, 1) is a topological space, the members of T are 

o 

OPEN SETS. We often use the notation [/ C X to indicate that U is an open subset of X. 

10.1.2. Remark. It is not unreasonable to regard condition (i) in the preceding definition as 
redundant. It depends entirely on the conventions you choose to make about the union and inter- 
section of an empty family of sets. Recall the discussion of this matter in the cautionary note 1.4.10. 
According to the arguments given there condition (iii) guarantees that the empty set automatically 
belongs to every topology on S and condition (ii) guarantees that S itself belongs to every such 
topology. 

The virtue of looking at things in this fashion is that it streamlines the definition of topology: 
a topology on a set S is just a family of subsets of S which is closed under unions and finite 
intersections. The price one pays for this slight convenience, as was mentioned in 1.4.10, is having 
to distinguish between the empty family 5 of subsets of the set S and the empty family & of subsets 
of a different set T. They are different because P| 5^ = 5 while f] & = T. Most authors avoid dealing 
explicitly with the problem by including condition (i) in their definition of topology. 

If you have not previously encountered topological spaces, the preceding definition may make 
little sense to you at first reading. Don't fret. As was remarked in section 8.2 this kind of abstract 
definition, although easy enough to remember, can be irritatingly difficult to understand. Staring 
at it doesn't help. It appears that a bewildering array of entirely different things might turn out 
to be topologies, even on relatively simple sets. And this is indeed the case. Some turn out to 
be interesting and useful and some do not. An understanding and appreciation of the definition 
come only gradually. You will notice as you advance through this material that many important 
concepts such as continuity, compactness, and connectedness are defined by (or characterized by) 
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open sets. Thus theorems which involve these ideas wih rely on properties of open sets, that is, on 
topology, for their proofs. 

10.1.3. Example. The most rudimentary topology one can place on a nonempty set X is the 
INDISCRETE TOPOLOGY in which the only open sets are the empty set and X itself. (Notice the 
spelling: there is nothing indiscreet about the indiscrete topology.) 

10.1.4. Example. At the opposite extreme is the largest possible topology on X, the discrete 
TOPOLOGY in which every set is open; that is, the topology on X is 

10.1.5. Example. Let X = {0,1} and 1 = {0,{O},X}. The topological space (X,T), which is 
sometimes useful as a counterexample, is called the Sierpinski space. 

10.1.6. Example. Let X be an infinite set and T be the family consisting of and of all sets 
[/ C X such that Lf^ is finite. This is the COFINITE TOPOLOGY on X. 

10.1.7. Definition. A subset of a topological space is closed if its complement is open. 

10.1.8. CAUTION. It is a common mistake to treat subsets of a topological space as if they were 
doors or windows, and to conclude, for example, that a set is closed because it is not open, or that 
it cannot be closed because it is open. These "conclusions" are wrong! When the set {1,2,3} is 
given the indiscrete topology the subset {1,2} is neither open nor closed. When the same set is 
given the discrete topology {1,2} is both open and closed. 

10.1.9. Example. In the Sierpinski space (see example 10.1.5) the closed sets are 0, {1}, and X 
itself. 

10.1.10. Example. If X is an infinite set with the cofinite topology (see example 10.1.6), then 
the closed subsets of X are the finite sets together with X itself. 

10.1.11. Proposition. //2l is a family of closed subsets of a topological space, then is closed. 

10.1.12. Exercise. Let 5 be a subset of a topological space X . What, precisely, do we mean by 
"the smallest closed set containing S"'? (If this isn't obvious, consult remark 5.3.6.) 

10.1.13. Proposition. If ^ is a finite family of closed subsets of a topological space, then |J21 is 
closed. 

10.1.14. Definition. If & and T are topologies on a set S and © C T, then we say that & is 
a smaller (or weaker, or coarser) topology than T. Similarly, T is said to be a larger (or 
stronger, or finer) topology than 6. We weaken a topology if we replace it by a smaller one; 
we STRENGTHEN a topology by replacing it with a larger one. 

10.1.15. Exercise. Let X = {a,b,c,d,e, f, g}. What is the smallest topology on X under which 
the sets {a}, {a,c}, and {e,^} are open and the set {a,c,e,g} is closed? List all of the closed sets 
in the resulting topological space. 

10.1.16. Exercise. Let X = {a,b,c}. How many distinct topologies can X be given? (If you are 
interested in a somewhat more challenging problem let X = {a,b,c,d}. 

The next three examples assume that you are already acquainted with the usual topology on 
the real line. If this is not the case, defer them until you have read example 10.2.6. 

10.1.17. Example. The set Z of integers is a closed subset of M. 

10.1.18. Example. The set {^ : n G N} is not a closed subset of M. 

10.1.19. Example. The set {^ : n G N} U {0} is a closed subset of M. 

10.1.20. Exercise. Exhibit an example to show that the intersection of an arbitrary family of 
open sets in a topological space need not be open. Also give an example to show that an arbitrary 
union of closed sets need not be closed. 
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10.1.21. Example. Any subset S of a topological space X is itself a topological space in a natural 

o 

way. The family of all sets of the form U r\ S where [/ C X is a topology on S. In this case we 
say that 5 is a (topological) SUBSPACE of X. The family {[/ n 5: C/ C X} is the topology 
on S INDUCED BY the topology on X. Subsets of topological spaces are thus ordinarily regarded 
as topological space in their own right without further explanation. 

10.1.22. Example. It is a routine matter to "transfer" a topology from a topological space X to 
a set S by means of a bijection. Suppose that / : — )• X is a bijection. Then the set of all subsets 

o 

of S of the form f^{U) where [/ C X is a topology on S. (Compare this with proposition 4.7.6.) 

10.1.23. Example. It is frequently convenient to regard the set M as a topological space. An 
easy way of doing this is to use the technique in the preceding example. Let / be the arctangent 
function on M and extend it to M by setting /(— oo) = — | and /(oo) = ^. We will observe in 
example 11.1.9 that with this topology on M the set M of ordinary real numbers is a subspace of M. 

10.2. Base for a Topology 

10.2.1. Definition. Let (AT, T) be a topological space. A family 55 C T is a BASE for T if each 
member of T is a union of members of !B. In other words, a family ^ of open sets is a base for T 
if for each open set U there exists a subfamily *B' of !B such that [/ = (J *B'. 

In practice it is often more convenient to specify a base for a topology than to specify the 
topology itself. It is important to realize, however, that there may be many different bases for the 
same topology. Once a particular base has been chosen we refer to its members as basic open sets. 

10.2.2. Example. Let A be a nonempty set and 53 be the collection of all one-point sets in A. 
That is, let *B = {{x}: x e A}. Then 53 is a base for the discrete topology on A. 

10.2.3. Proposition. Let X he a topological space. A necessary and sufficient condition for a 

o 

family 53 of open subsets of X to be a base for the topology on X is that for every {/ C A and every 
X £ U there exists i? G 53 such that x £ B <^ U . 

10.2.4. Corollary. Let X be a set, ^ be a base for a topology T on X , and 53' be a base for a 
topology T' on X. Then T' is stronger than T if and only if for every set S G 53 and every x £ B 
there exists a set B' G !B' such that x £ B' C B. 

Proposition 10.2.3 gives a necessary and sufficient condition for a family of open sets to be 
a base for some particular topology on a set. It is quite natural, however, to ask a somewhat 
more interesting question. Suppose we are given a collection !B of subsets of a set A: under what 
conditions is !B a base for some topology on A? An answer is given in the next proposition. 

10.2.5. Proposition. A family 53 of subsets of a set X is a base for some topology on X if and 
only if 

(i) U ^ = ^ and 

(ii) for every U , V £ ^ and for every x G U Ci V, there exists a set W in ^ such that 

X £W ounv. 

10.2.6. Example. Since the family of all bounded open intervals (a, b) in the real line satisfies 
conditions (i) and (ii) of proposition 10.2.5, it is a base for some topology on the real line M. This 
is the USUAL TOPOLOGY on M. Under this topology a subset of M is open if and only if it is a union 
of a family of bounded open intervals. Thus, happily, open intervals (even ones of infinite length) 
turn out to be open sets, and closed intervals are closed sets. When we say that this is the usual 
topology for the real numbers we mean that unless the contrary is explicitly stated, whenever we 
regard M as a topological space it will be with respect to this topology. 
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10.2.7. Example. If a e Z and d G N then the set {o + nd: n e Z} is an arithmetic PRO- 
GRESSION. (For example, if a = 2 and d = 5, then the corresponding arithmetic progression is 
{...,-8,-3,2,7,12,...}.) 

(a) The family of all arithmetic progressions is a base for a topology on Z. 

(b) Every arithmetic progression is a closed set as well as an open set. 

(c) It is possible and entertaining to give a topological proof that there are infinitely many 
prime numbers. Hint. Consider the set A = [JAp where Ap is the arithmetic progression 

p 

consisting of all integer multiples of a prime number p and the union is taken over the set 
of all prime numbers. 

10.2.8. Proposition. If A is a nonempty closed subset o/M which is bounded above, then supj4 
is a member of A. 

10.3. Some Elementary Topological Properties 

10.3.1. Definition. A topological space is second countable if there exists a countable base 
for its topology. 

10.3.2. Example. The topological space M (with its usual topology) is second countable. Hint. 
Consider bounded open intervals with rational endpoints. Use example 3.2.12 and proposition 3.2.9. 

10.3.3. Example. A discrete topological space is second countable if and only if it is countable. 

10.3.4. Definition. Two nonempty subsets A and i? of a topological space X can be separated 
BY open sets if there exists open subsets U and V oi X such that A O U, B OV, and UnV = ^. 
(And of course we say that two points a and b can be separated by open sets if {a} and {6} can be.) 

10.3.5. Example. In the real line M the sets [—1,0) and (0, 1] can be separated by open sets; the 
sets [—1,0] and (0, 1] can not. 

10.3.6. Definition. A Hausdorff topological space is one in which every pair of distinct points 
can be separated by open sets. 

10.3.7. Definition. If U is an open subset of a topological space X and x £ U we say that C/ is a 

NEIGHBORHOOD of X. 

Thus a topological space X is Hausdorff if for any two distinct points in X we can find disjoint 
neighborhoods. 

10.3.8. Example. Every discrete topological space is Hausdorff. 

10.3.9. Example. No space with the indiscrete topology having at least two points is Hausdorff. 

10.3.10. Example. An infinite set with the cofinite topology is not Hausdorff. 

10.3.11. Example. The real line (with its usual topology)is Hausdorff. 

10.3.12. Example. The Sierpihski space 10.1.5 is not Hausdorff. 

10.3.13. Exercise. Is the space in exercise 10.1.15 Hausdorff? 

10.4. Metric Spaces 

Most of the topological spaces appearing in these notes are metric spaces — that is, spaces whose 
topology is determined by a notion of distance. Although there is a bewildering variety of metric 
spaces, it is also true that they share some simple and pleasant properties, properties so striking 
that it seems natural to regard them collectively as a single class of examples of topological spaces. 
Thus the present section is little more than an extended list of examples of an example. Metric 
spaces turn out to be among the nicest inhabitants of the unruly world of general topological spaces. 
More details on these spaces can be found in my ProblemText [1 . ]. 
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10.4.1. Definition. Let M be a nonempty set. A function (i:MxM^M is a METRIC (or 

DISTANCE function) on M if for all x, y, z e M 

(1) d{x,y) = d{y,x), 

(2) d{x,y) < d{x,z) + d{z,y), (the triangle inequality) 

(3) d{x, x) = 0, and 

(4) d{x, y) = only if x = y. 

If (i is a metric on a set M, then we say that the pair (M, d) is a metric space. The notation 
"d{x,y)" is read, "the distance between x and y". (As usual we substitute the phrase, "Let M be 
a metric space ..." for the correct formulation "Let (M, d) be a metric space . . . " . 

If d: M X M — > M satisfies conditions (l)-(3), but not (4), then d is a PSEUDOMETRIC on M. 

10.4.2. Definition. If A and B arc subsets of a metric space M, then the DISTANCE BETWEEN A 
AND B, denoted by d{A, B), is defined to be inf{d(a, b): a G A and b E B}. 

10.4.3. Proposition. If d is a metric on a set M, then d{x,y) > for all x, y & M. 

10.4.4. Proposition. For all points x, y, and z in a metric space (M, d) 

\d{x,y) - d{x,z)\ < d{y,z) . 

10.4.5. Definition. For each point a in a metric space {M,d) and each number r > we define 
Br{a), the OPEN BALL about a of radius r, to be the set of all those points whose distance from a 
is strictly less than r. That is, 

Br{a) := {x e M: d{x,a) < r} . 
We also define Cr{a), the closed ball about a of radius r, by 

Cr{a) :={x€ M: d{x, a) < r} . 
and Sr{a), the sphere about a of radius r, by 

Sr{a) := {x G M : d{x, a) = r} . 

When r = 1 we have the OPEN UNIT BALL (or closed unit ball or UNIT sphere) about a. When 
the underlying set M is also a vector space we sometimes refer to the unit ball in M, by which we 
mean the open unit ball about in M. 

10.4.6. Example. The absolute value of the difference of two numbers is a metric on M. We will 
call this the usual metric on M. In this metric the open ball about a of radius r is just the open 
interval {a — r,a + r). 

10.4.7. Exercise. Define d{x,y) = |arctanx — arctanyj for all x, y G M. 

(a) Show that d is a metric on M. 

(b) Find d(-l, V3). 

(c) Solve the equation d{x, 0) = d{x, \/3). 

10.4.8. Example. Define d{x, y) = \x'^ — for all x, y eM.. Then cZ is a pseudometric but not a 
metric on M. 

10.4.9. Example. Let M be any nonempty set. For x, y E M define 

1 if X ^ y 



d{x,y) 



if X = y . 



It is easy to see that d is a metric; this is the DISCRETE METRIC on M. Although the discrete 
metric is rather trivial it proves quite useful in constructing counterexamples. 

10.4.10. Proposition. If M is a metric space, a G M, and < r < s, then Br{a) C Bs{a). 
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10.4.11. Example. For points x = (xi, . . . , Xn) and y = {yi, . . . , yn) in let 

d{x,y) := - ykf\ . 

Then d is the usual (or Euclidean) metric on M". The only nontrivial part of the proof that d 
is a metric is the verification of the triangle inequality (that is, condition (2) of the definition): 

d{x,y) < d{x,z) +d{z,y) . 

To accomplish this let x = (xi, . . . , x„), y = (yi, . . . , yn), and z = (zi, . . . , Zn) be points in M". 
Apply Minkowski's inequality (6.5.2) with Uk = Xk — Zk and = Zk — yk ^or 1 < k < n to obtain 

/ " \ I 

= [^{(xk - Zk) + {zk - yk)f\ 

< (X](2;fc - zk)A ' + (X^(^fc - 2/fc)^) ' 

= d{x,z) +d{z,y) . 

The Euclidean metric is by no means the only metric on which is useful. Two more examples 
follow (10.4.12 and 10.4.13). 

10.4.12. Example. For points x = (xi, . . . , x„) and y = {yi, . . . , yn) in M" let 

n 

di{x,y) := - y^l- 

k=l 

It is easy to see that di is a metric on M". When n = 2 this is frequently called the TAXICAB 
metric. (Why?) 

10.4.13. Example. For x = (xi, . . . ,x„) and y = {yi, . . . ,yn) in M" let 

duix, y) := max{|xfc - yfc| : 1 < ^ < n}. 
Then du is a metric on M". The triangle inequality is verified as follows: 

\xk - yk\ < \xk - zk\ + \zk - yk\ 

< max{ \xi — Zi\ : 1 < i < n} + max{ \zi — yi\ : 1 < i < n} 
= du{x,z) +du{z,y) 
whenever 1 < A; < n. Thus 

du{x,y) = max{|xfc - yk\-'^<k<n] 
< du{x,z) +du{z,y) . 

The metric du is called the uniform metric on M". The reason for this name will become clear 
later (see the comment after the verification of example 10.4.32). 

Notice that on the real line the three immediately preceding metrics agree; the distance between 
points is just the absolute value of their difference. That is, when n = 1 the metrics given in 
examples 10.4.11, 10.4.12, and 10.4.13 reduce to the one given in example 10.4.6. 

10.4.14. Remark. A crucial observation is that every metric space is a topological space. More 
precisely, we use proposition 10.2.5 to show that the family of open balls in a metric space M is 
a base for a topology on M. Unless the contrary is explicitly stated we will always regard metric 
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spaces as being topological spaces under this topology, which we refer to as the topology induced by 
the metric. Since condition (i) of proposition 10.2.5 obviously holds for the family of all open balls 
in M, we need only verify condition (ii). 

10.4.15. Proposition. Let a and b be points in a metric space M and r, s > 0. If c belongs to 
Br (a) n Bs{b), then there exists a number t > such that Bt{c) C Br (a) Pi Bs(b). 

10.4.16. Corollary. Every metric space {M,d) is a topological space (M, T). The open balls of 
(M, d) are a base for the topology T. 

10.4.17. Remark. Assertions like the preceding corollary are very common in mathematics. Lit- 
erally the claim made there is nonsense: a set together with a distance function is surely not the 
same thing as a set together with a family of subsets closed under certain operations. On the 
other hand it does say slightly more than, "Every metric space (M, d) induces a topology on M." 
It is taken to mean, additionally, that whenever we encounter a metric space M we will identify 
it with its induced topological space in the sense that references to topological properties — open, 
closed, compact, connected, and so on — of subsets of M will be regarded as meaningful without 
any explanation. In circumstances when you wish to consider a topology on a metric space other 
than the one induced by the metric, you must be quite explicit about what you are doing. 

10.4.18. Example. It is clear from the preceding proposition 10.4.15 that every open ball in a 
metric space is an open set. Furthermore, every closed ball is a closed set and every sphere is a 
closed set. 

10.4.19. Example. The space with its usual (Euclidean) metric is a second countable topo- 
logical space. 

10.4.20. Example. The topology induced by the discrete metric on a set M (see example 10.4.9) 
is the discrete topology on M (see example 10.1.4). 

10.4.21. Example. The topology induced by the usual metric on the set M (see example 10.4.11) 
is the usual topology on M (see example 10.2.6). 

The next proposition shows that as a topological space every metric space is Hausdorff. 

10.4.22. Proposition. If a and b are distinct points in a metric space, then there exists a number 
r > such that Br (a) and Br{b) are disjoint. 

Ahead of us lie many situations in which it will be possible to replace a computationally 
complicated metric on some space by a simpler one without affecting any topological property 
of the space. Below (in proposition 10.4.26) we give a sufficient condition for this process to work. 

10.4.23. Definition. Two metrics di and d2 on a set M are equivalent if they induce the same 
topology on M. 

10.4.24. Definition. Two metrics di and d2 on a set M are strongly equivalent if there exist 
numbers a, /3 > such that 

di{x,y) < ad2{x,y) and 
d2{x,y) < /3di{x,y) 

for all x and y in M. 

10.4.25. Example. On M? the three metrics d, di, and du, defined in examples 10.4.11, 10.4.12, 
and 10.4.13, are all strongly equivalent. 

Hint for proof . First prove that if a, 6 > 0, then 

max{a, 6} < a + 6 < -v/2 \/a^ + 6^ < 2 max{a, b} . 
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10.4.26. Proposition. // d and p are strongly equivalent metrics on a set M , then they are 
equivalent. 

Hint for proof . Use corollary 10.2.4. 

10.4.27. Definition. A metric d on a set M is bounded if sup{(i(x,y): x,y ^ M} < oo. 

10.4.28. Example. Let (M, d) be a metric space. The function p defined on M x M by 

f X d{x,y) 
l + d{x,y) 

is a bounded metric on M. 

u V 

Hint for proof . Show first that < whenever < u < v. 

I + u 1 + v 

10.4.29. Proposition. Every metric is equivalent to a bounded metric. 
Hint for proof . Use corollary 10.2.4. 

10.4.30. Example. In example 10.4.28 take M to be the real line M and d to be the usual metric 
on M (see 10.4.6). Then the metrics d and p are equivalent but not strongly equivalent on M. 

10.4.31. Definition. A subset B of a metric space M is bounded if it is contained in some open 
ball. A function f : S ^ M from a set into a metric space is BOUNDED if its range is a bounded 
subset of M. (In particular, a sequence {xn) of points in a metric space is bounded if its range 
{xn'- n G N} is a bounded subset of the space.) The family of all bounded functions from S into 
M is denoted by B{S,M). 

10.4.32. Example. Let 5 be a nonempty set and (M, p) be a metric space. For f,g£ B{S, M) 
define 

dU^a) = sup{/3(/(a;),5(x)) : x e S] . 
Then d is a metric on B{S, M). 

Proof. The verification of this is not difficult. Establishing the triangle inequality however 
does require some care. To this end suppose that /, 5, and h belong to B{S,M). The inequality 

p{f{x), h{x)) < p{f{x),g{x)) + p{g{x), h{x)) 

obviously holds for all x G S*, and therefore 

snp{p{f{x), h{x)) ■.x£S}< snp{p{fix),g{x)) + p{g{x), /i(x)) : x G S} . (10.1) 

Next notice that 

{p{f{x),g{x)) + p{g{x),h{x)): x e S} Q {p{f{x),g{x)) + p{g{y),h{y)) : x,y G S} . 

Then from exercise 5.2.3 we see that 

sup{p{f{x),g{x)) + p{gix), h{x)) : x S} 

< sup{p{f{x),g{x)) + p{g{y), h{y)) : x, y G 5} . 

Finally, it follows from exercise 6.1.9 that 

sup{p(/(x), 5r(x)) + p{g{y), h{y)) : x,y e S] 

= sup{p(/(x), g(x)) : X G S"} + snp{p{g{y), h{y)) : y G S"} 

Putting inequalities (10.1), (10.2), and (10.3) together produces the desired result 

d{f,h)<d{f,g) + d{g,h). □ 

This example is particularly important in the special cases where M = M or M = C and 
p{p,q) = \P - q\ for p, q e R (or C). Then d is the uniform metric on B{S,R) (or B{S,C) ). 
Notice that example 10.4.13 is a special case of this. (An n-tuple may be regarded as a function 
on the set = {1, . . . , n}. Thus = B{Nn, M).) 



10.5. INTERIORS AND CLOSURES 



79 



10.4.33. Definition. A sequence (xn) in a topological space X is convergent if it is eventually 
in every neighborhood of some point a in X. In this case we say that the sequence (xn) converges 
TO a and write — )• a as n — )• oo. In cases where limits are unique we say that a is the limit 
of the sequence and we use the notation a = lim„^oo Xn- These notations are often shortened to 
x„ — )• o or a = lim Xn ■ 

10.4.34. Proposition. Limits of sequences in metric spaces are unique (if they exist). 

10.4.35. Example. Limits of sequences in topological spaces need not be unique (even when they 
exist). 

10.5. Interiors and Closures 

The preceding section was concerned with an important class of examples of topological spaces — 
the metric spaces. We now return to general topological spaces and investigate a few of their most 
elementary properties. 

10.5.1. Definition. Let j4 be a subset of a topological space X. A point x is an interior point 
of A if some neighborhood of x lies entirely in A. The interior of A, denoted by A°, is the set of 
all interior points of A. That is, 

A° := {x e M: there exists U QM such that x^U (lA}. 

10.5.2. Example. If M has its usual topology and A is the closed interval [0, 1], then A° = (0, 1). 

10.5.3. Example. Let A be the closed unit disk {(x, y) : + < 1} in M^. Then the interior of 
A is the open disk {(x, y) : + < 1}. 

10.5.4. Example. Let ^ = C R^. Then A° = 0. 

10.5.5. Proposition. A subset U of a topological space is open if and only ifU = U° . 

10.5.6. Proposition. Let A and B he subsets of a topological space. 

(a) IfACB, then A° C B° . 

(b) A°° = A°. (As you would expect, means (A°)°.) 

(c) {AnB)° = A°nB°. 

10.5.7. Proposition. Let A be a family of subsets of a topological space. Then 

{f~]Ay C f^{A°: AeA}<z [j{A°: A e A} ^ {[jAy . 

10.5.8. Example. Give examples to show that in the preceding proposition each containment may 
be proper. 

Next we give a sequential characterization of interior in metric spaces. 

10.5.9. Proposition. A point a in a metric space M is in the interior of a subset S of M if and 
only if every sequence in M that converges to a is eventually in S. 

10.5.10. Definition. A point p in a topological space X is an accumulation point of a set 
A Q X every neighborhood of p contains a point of A distinct from p. (We do not require that 
p belong to A.) We denote the set of all accumulation points of Ahy A' . This is sometimes called 
the DERIVED SET of A. The closure of the set A, denoted by A, is AU A'. 

10.5.11. CAUTION. Some authors prefer to work with limit points rather than accumulation 
points. The concepts are similar; a point p is a limit point of a set A if every neighborhood of p 
contains a point of A. Thus if, for example, A = {0} U [1,2] and p = 0, then p is a limit point of 
A but is not an accumulation point of A. The matter is further complicated by the fact that some 
authors, see [73] for example, define limit point as I have defined accumulation point. Additionally, 
the notation A' is occasionally (again see [23]) to denote the complement of a set A. 
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10.5.12. Example. Let have its usual metric and A be [(0, 1) x (0, 1)] U {(2,3)} C M^^ Then 
A' = [0, 1] X [0, 1] and ]4 = ([0, 1] x [0, 1]) U {(2, 3)}. 

The following proposition is a slight refinement of 10.2.8. 

10.5.13. Proposition. 7/^4 C M is hounded above, then sup^ belongs to A. 

10.5.14. Proposition. A subset F of a topological space is closed if and only if F = F. 

10.5.15. Definition. A subset A of a topological space X is dense in X if A = X. 

10.5.16. Example. The rational numbers are dense in the reals. 
Hint for proof . Use exercise 6.1.13. 

10.5.17. Example. The set is a subset of the topological space M?. Every ordered pair of 
real numbers is an accumulation point of since every open ball in M? contains (infinitely many) 
points with both coordinates rational. So the closure of is all of M?. That is, Q2 is 

dense in M . 

10.5.18. Proposition. Let A be a subset of a topological space. Then 

(a) {A°y = Arc. 

(b) {Ar = {Ar. 

The relationship between interiors and closures given in proposition 10.5.18 makes the next two 
results quite routine. 

10.5.19. Proposition. Let A and B be subsets of a topological space. Then 

(a) IfAQB, then ACB. 

(b) J = A . _ _ 

(c) AU B = AU B. 

10.5.20. Proposition. Let A be a family of subsets of a topological space. Then 

f~]A c AeA}<^ \J{A: A £ A} c \Ja. 

10.5.21. Example. Give examples to show that in the preceding proposition each containment 
may be proper. 

As is the case with interiors, closures can be characterized in metric spaces by means of se- 
quences. (See proposition 10.5.9.) 

10.5.22. Proposition. A point b in a metric space M belongs to the closure of a subset S of M 
if and only if some sequence in S converges to b. 

10.5.23. Proposition. In a metric space the closure Br{a) of an open ball is always contained in 
the corresponding closed ball Cr{a). 

10.5.24. Example. The closure of an open ball and the corresponding closed ball need not be 
equal. For example in the reals with the discrete metric i?i(0) is not the same as Ci(0). 

10.5.25. Definition. If ^4 is a subset of a topological space X, then the boundary of A denoted 
by dA is defined to be Z n A^. 

10.5.26. Proposition. If A is a subset of a topological space X, then 

(a) A = A°^dA = AUdA and 

(b) A° = A\dA = A\dA. 
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CONTINUITY AND WEAK TOPOLOGIES 

In this chapter we begin an examination of the category TOP whose objects are topological 
spaces and whose morphisms are continuous functions. 

11.1. Continuity — the Global Property 

Continuity has come a long way. It appeared in the philosophical arguments of Zeno, who was 
born about 490 BC in Elea (now in southern Italy). By the seventeenth century the concept had 
taken on a quite definite mathematical flavor. Leibniz (born in 1646 in Leipzig) formulated his 
lex continui (law of continuity) in quite a few different ways. One version is, "nature never makes 
leaps". Another, more sophisticated, version is (see [29], p. 539) 

When the difference between two instances in a given series or that which is 
presupposed can be diminished until it becomes smaller than any given quantity 
whatever, the corresponding difference in what is sought or in their results must 
of necessity also be diminished or become less than any given quantity whatever. 

It is surely not a great leap from this formulation (from 1687) to the familiar e-5 definition in 
beginning calculus (see remark 11.2.2). Another two and a half centuries of reflection on and 
tinkering with the concept lead to the currently favored (and quite general) definition: a function 
is continuous if the inverse image of every open set is open. 

11.1.1. Definition. A function f : X ^ Y between topological spaces is CONTINUOUS if f*^{U) is 
an open subset of X whenever U is an open subset of Y. We denote by C{X,Y) the family of all 
continuous functions mapping the topological space X into the space Y. For real valued functions 
we usually abbreviate C(X, M) to C{X). 

In order to see that we obtain a category by declaring topological spaces to be objects and 
continuous maps to be morphisms, it is necessary to know that the identity function on a topological 
space is continuous (which is obvious) and that the composite of continuous functions is continuous. 

11.1.2. Proposition. If f : X ^ Y and g: Y ^ Z are continuous mappings between topological 
spaces, then the composite function g o f : X ^ Z is continuous. 

Proof. If C/ C Z, then, since g is continuous, the set g*^{U) is open in Y . Then, similarly, 
{g o Z)"^ = f^{g^{U)) is an open subset of X (see exercise 2.5.4(b)). □ 

Thus the class of topological spaces together with the continuous functions between them con- 
stitute a category. We denote it by TOP. The isomorphisms in this category are traditionally 

called HOMEOMORPHISMS. 

11.1.3. Example. Every constant function between topological spaces is continuous. 

11.1.4. Example. Let y be a subspace of a topological space X. The inclusion map c: Y ^ X is 
continuous. 

11.1.5. Example. Any restriction of a continuous function is continuous. 
Hint for proof . Use example 11.1.4 and proposition 11.1.2. 

11.1.6. Example. Let X be a nonempty set with the discrete topology and Y be an arbitrary 
topological space. Then C{X,Y) = J^{X,Y). That is, every function from X to F is continuous. 
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11.1.7. Example. Let X be a nonempty set with the indiscrete topology and y be a Hausdorff 
topological space. Then the only continuous functions from X to y are the constant functions. 



11.1.9. Example. Assume that we know that the arctangent function and the tangent function 
(restricted to the interval (— f , f ) ) are both continuous. Then it is clear that our construction of 
the topology on the extended real numbers (see example 10.1.23 makes the set of ordinary reals M 
a subspace of M. The collection !B of all open intervals in M is a base for the topology on M. (The 
open intervals in M are of four types: [—oo,b), {a,b), (a, oo], and, of course, M itself.) 

11.1.10. Example. The identity map id]g from the real line with the discrete topology to the real 
line with its usual topology is an example which shows that a bijective morphism in the category 
TOP need not be an isomorphism. That is, continuous bijections between topological spaces need 
not be homeomorphisms. 

11.1.11. Definition. Let {X,'Z) be a topological space. A subfamily S C T is a subbase for the 
topology T if the family of all finite intersections of members of 6 is a base for T. 

We saw in proposition 10.2.5 that not every family of subsets of a set S can be used as a base 
for a topology on S. The situation is much more agreeable in the case of subbases. 

11.1.12. Proposition. Any nonempty family of subsets of a set S is a subbase for some topology 
on S. (If you do not approve of the conventions mentioned in 1.4-10 and 10.1.2, add the hypothesis 
that the family contains and S.) 

11.1.13. Proposition. Let X and Y be topological spaces and f: X ^Y. 

o 

(a) Let ^ be a base for the topology on Y. Then f is continuous if and only if f^{B) C X 
for every B £ 

o 

(b) Let & be a subbase for the topology on Y. Then f is continuous if and only if f^{S) C X 
for every S £ 6. 

11.1.14. Example. The function / : R — t- R: x i— )■ —2x + 3 is continuous. 

Hint for proof . Recall from example 10.2.6 that the set of bounded open intervals is a base for 
the usual topology on R. 

11.1.15. Example. The map x i— t- x^^^ from R into R is continuous. 

11.1.16. Example. The map x i— )• |x| from R into R is continuous. 

11.1.17. Proposition. A function f : X ^ Y between topological spaces is continuous if and only 
if the inverse image of every closed subset ofY is closed in X . 

11.1.18. Proposition. A function f : X ^ Y between topological spaces is continuous if and only 
if f^(A) C f^{A) for every subset A of X . 



11.2.1. Definition. Let X and Y be topological spaces and a £ X . A function / : X — )■ y is 
CONTINUOUS AT a if f^{V) contains a neighborhood of a whenever F is a neighborhood of f{a). 

11.2.2. Remark. If in the preceding definition 55 is a base for the topology on Y, it is easy to see 
that it is sufficient to require that f^iV) contain a neighborhood of a for each set V in ^ which 
contains /(a). It follows that if {M,d) is a metric space, then a function / : X — )• M is continuous 
at a if and only if /*~ (^^(/(a))) contains a neighborhood of a for each e > 0. Put differently: / is 




for X < 
for X > 



. Then / is not a continuous 
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continuous at a if and only if for every e > there exists a neighborhood f7 of a such that x £ U 
impHes d{f{x),f{a)) < e. 

If /: Ml — )• M2 where both Mi and M2 are metric spaces, then / is continuous at a if and only 
if for every e > there exists a 6 > such that 

r{B,{fia)))^Bs{a). 

Another way of saying exactly the same thing: / is continuous at a if and only if for every e > 
there exists 6 > such that d2{f{x), /(a)) < e whenever di{x, a) < 6. 

Specializing further to the case where / is a real valued function of a real variable we have 
the usual beginning calculus definition of continuity at a point: / is continuous at a point a in 
its domain if and only if for every e > there exists S > such that 1/(2;) — /(a)| < e whenever 
|x — o| < 6. 

Of course, the relationship between the local and global properties of continuity is exactly what 
one would expect. 

11.2.3. Proposition. A function f : X ^ Y between topological spaces is continuous if and only 
if it is continuous at each point of X . 

11.2.4. Example. Clearly C \ {0} is a metric space when the distance between two nonzero 
numbers w and z \s \w — z\. The map z 1— )• 1/ z of C \ {0} into itself is continuous. 

11.2.5. Example. The map z ^~z from C into itself is continuous. 

Here is a sequential characterization of continuity for functions between metric spaces. An 
improved version holding for general topological spaces will appear in proposition 16.4.24. 

11.2.6. Proposition. Let M and N he metric spaces. A function f : M ^ N is continuous at a 
point a in M if and only if f[xn) — ?• f{a) in N whenever (xn) is a sequence converging to a in M. 

11.2.7. Proposition. Let X and Y he topological spaces and suppose that Y is Hausdorff. Suppose 
also that f,g:X^Y are continuous functions which agree on a dense subset D of X. (That is, 
suppose that f{x) = g[x) for all x € D.) Then f = g. 

11.2.8. Proposition. Let f : X he a continuous function on a topological space X and a G X. 
V fio-) > 0; then there exist a neighborhood U of a and a number b > such that f{x) > b for 
every x £ U . 

11.3. Uniform Continuity 

11.3.1. Definition. Let {M,d) and {N,p) be metric spaces. A function /: M ^ N is uniformly 
CONTINUOUS if for every e > there exists 6 > such that p{f{x),f{y)) < e whenever x, y £ M 
and d{x, y) < 6. 



11.3.2. Example. The function / 

11.3.3. Example. The function / 

11.3.4. Example. The function g 



M— 7>M: XI— )-3x — 4is uniformly continuous. 
[1,00) — )• M: X I— )• x~^ is uniformly continuous. 
(0, 1] — )• M: XI—)- x~^ is not uniformly continuous. 

11.3.5. Example. Let M be an arbitrary positive number. The function 

/: [0,M] ^ M: X 

is uniformly continuous. Prove this assertion using only the definition of "uniform continuity". 
Then show that the function 

5: [0,00) M: X 1-^ x^ 

is not uniformly continuous. 

11.3.6. Example. The function g: [0, 00) — t- M: x 1— t- y/x is uniformly continuous. 

11.3.7. Proposition. Every uniformly continuous function is continuous. 
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11.4. Weak Topologies 

Here is a situation which frequently crops up in analysis. We are given an indexed family 
(/a)oeA of functions from a set S into a corresponding family {Xa)aeA of topological spaces . 
What we want to do is put a topology on S in such a way that 

(1) each of the functions fa is continuous, and 

(2) if 5 is a function from a topological space Y into S such that the composite functions fa°g 
are continuous, then g is continuous. 



Certainly condition (1) by itself is easy enough to satisfy: just give S the discrete topology. 
The trouble with this solution to our problem is that the discrete topology may be too large to 
satisfy (2). To see this look at a simple example where S is the set of real numbers and there is 
only one function /, the identity map from the set M to the topological space M (with its usual 
topology). Give = M the discrete topology (and denote it by M^). Now consider what happens if 
we take 1" = M (again with its usual topology) and g to be the identity map. 




Clearly the composite of the two identity functions is continuous, but the one on the left of the 
diagram is not. 

So the problem confronting us is to choose a topology for S which is large enough to make each 
fa continuous and at the same time small enough to allow g to be continuous. The solution to this 
problem turns out to be pleasantly simple: use the smallest topology on S which makes the /q's 
continuous. How is this accomplished? Easily: all we need to guarantee is that for every a the 
inverse image under fa of each open subset of Xa is open in S. So take the family of these sets as 
a subbase for a topology on S. 

11.4.1. Definition. Suppose that 5" is a set, that for every a £ A (where A is an arbitrary index 
set) Xa is a topological space, and that fa - S ^ Xa for every a £ A. Let 

6 = {fa^iUa) -.UatXa and a£A}. 

Use the family (3 as a subbase for a topology on S. This topology is called the weak topology 
induced by (or determined by) the functions fa- 

o 

11.4.2. Exercise. In definition 11.4.1 the family {/„ {Ua)'- Ua ^ Xa and a G A} was used for 
a subbase for the weak topology on S. Explain briefly why we could have used a smaller family 
of sets to generate the sa-ine topology. For example, show that if is a base for the topology on 
Xa, then {fa^ {Ba) Ba € 5Sq, and a £ A} is a subbase for the weak topology on S. Show that 
the preceding sentence remains true even if the word "subbase" is substituted for "base" . 

The next two propositions verify that we have actually solved the problem that we started out 
to solve. 

11.4.3. Proposition. Under the weak topology (defined above) on a set S each of the functions fa 
is continuous; in fact the weak topology is the weakest topology on S under which these functions 
are continuous. 

11.4.4. Proposition. Let X be a topological space with the weak topology determined by a family 
J- of functions. Prove that a function g: W ^ X , where W is a topological space, is continuous if 
and only if f o g is continuous for every f £ T . 



11.4.5. Example. Let be a nonempty set and TiS) = J-"(5, M) = M be the vector space of 
all real valued functions on S (see example 4.4.7). The weak topology induced on T{S^ by the 
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family of all evaluation functionals where x € S (see definition 4.5.13) is called the topology of 
pointwise convergence. The reason for this terminology will become clear in example 11.4.8 below. 

11.4.6. Notation. If 5 is a set we denote by FinS" the family of all finite subsets of S. 

11.4.7. Example. Let 5 be a nonempty set and J-{S) be the vector space of all real valued 
functions on S. Then the family of all sets of the form 

U{g, F, e) := {/ G J"(S') : \f{x) - g{x)\ < e whenever x G F} , 

where g G ^(S), F G Fin 5* and e > 0, is a base for the topology of pointwise convergence (see 
example 11.4.5). 

11.4.8. Example. Let 5 be a nonempty set. A sequence (/„) of functions in T{S) converges to a 
function g G J^{S) in the topology of pointwise convergence if and only if fn{x) — )• g{x) for every 
xGS. 

11.4.9. Definition. Let (^a)^^^^^^^ be an indexed family of topological spaces. The weak topology on 
the Cartesian product Yl ^\ induced by the family of coordinate projections vr^ (see definition 8.3.6) 
is called the product topology. 

11.4.10. Proposition. The product of a family of Hausdorff spaces is Hausdorff. 

11.4.11. Remark. Let {Xo,)a£A be an indexed family of topological spaces and g: Y ^ Yl-^a 
be a function from a topological space Y into the product of the X^s. If bas the product 
topology, then according to proposition 11.4.4 g is continuous if and only if tTq, o (7 is continuous for 
each a G A. Thus, for example, a curve c: M — )• M": t 1— )• (ci(t), . . . ,c„(t)) in R" is continuous if 
and only if each of its coordinate functions Ck is. 

11.4.12. Corollary. Let f : W ^ X and g: W ^ Y where W, X, and Y are topological spaces, 
and let h = (/, g). Then h is continuous if and only if f and g are. 

11.4.13. Example. The product X x y of two topological spaces (with the topology defined 
above) is actually a product in the category TOP of topological spaces and continuous maps. 

11.4.14. Example. Let X and Y be topological spaces. Declare a subset U of their disjoint union 
X l±) y to be open if [/ H X is open in X and U OY is open in Y. The family of such sets is a 
topology on X l±) y called the disjoint union topology. The resulting space is a coproduct in 
the category TOP. 

11.5. Subspaces 

In example 10.1.21 we introduced the notion of a subspace of a topological space. Recall that 
a subset y of a topological space X becomes a subspace of X if we equip it with the topology 
induced by X, that is, with the topology whose members are intersections of Y with open subsets 
of X. An equivalent description of this induced topology on Y is that it is the smallest topology 
on y which makes the inclusion mapping y — )• X continuous. Yet another way of saying exactly 
the same thing: it is the weak topology on Y determined by the inclusion map l: y — )• X. 

11.5.1. CAUTION. Be aware that being open (or closed) is not what I will call an intrinsic 
property of a set. A set (with a fixed topology) can be open in one space but fail to be open in 
another. Let's take an obvious example. It is nonsense to ask out of context if the interval (1, 2] is 
an open set. As a subset of [0, 2] (regarded as a subspace of M) it is indeed open. But as a subset of 
M it is certainly not open. It is standard to say U is an open set without further qualification when 
it is clear what topological space is being referred to. But if in some discussion U is contained in a 
subspace y of a topological space X, it must be clear whether the intended meaning is that U is 
open in Y of that it is open in X. 
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In a similar vein, when y is a subspace of a topological space X and j4 C y, some of the 
notation introduced earlier becomes ambiguous. In particular, the notation for closure and interior 
are inadequate. If for example X = M, y = (0, 2), and A = (1, 2), what would A be? If we intend 
the closure of A in the subspace Y , the answer would be [1, 2), while if closure in X were intended 
the answer would be [1,2]. When this sort of ambiguity threatens it is wise to adopt a more explicit 
notation. 

11.5.2. Notation. Let y be a subspace of a topological space X and A C y. We will write cly(A) 
for the closure of A in the space Y and clx(^) for its closure in X. Similarly we write inty(74) for 
the interior of yl as a subset of Y and intx(^) for its interior in X. 

11.5.3. Proposition. Let Y Q X where X is a topological space. A subset A of Y is closed in Y 
if and only if there exists a closed set F in X such that A = Y D F . 

11.5.4. Proposition. If A <^Y C X where X is a topological space, then 

cIy{A) = yncix(^). 

Thus ifY is a closed subset of X, then the closure of a set in Y is the same as its closure in X. 

11.5.5. Proposition. If X is a topological space, V is an open subset of X, and U is an open 
subset of the subspace V , then U is open in X . 

11.5.6. Proposition. If A <^Y Q X where X is a topological space, then intx(^) Q inty(>l). //, 
additionally, Y is an open subset of X, then equality holds. 

11.5.7. Example. Letting A = (0, 1], Y = [0, 1], and X = M shows that equality need not hold in 
general in the preceding proposition. 

It is an interesting problem to decide which properties of a topological space are inherited by 
its subspaces. A property P of topological spaces which holds in every subspace whenever it holds 
in the "parent" space is a hereditary property. A property P is F-hereditary if whenever it 
holds in a topological space X it holds for every closed subset of X (regarded as a subspace of X), 
and it is G-hereditary if it holds for every open subset of X whenever it holds in X. (Memory aid: 
associate F with the French word ferme, which means closed.) 

11.5.8. Example. The property of being Hausdorff is hereditary as is the property of being second 
countable. We will see later that compactness is an F- hereditary property (see proposition 17.1.7) 
and that being a Baire space is G- hereditary (see proposition 29.3.16). 

11.6. Quotient Topologies 

A weak topology on a set S is induced by a family of functions fa - S ^ Xa mapping into 
topological spaces. It is the weakest topology on S which makes all these functions continuous (see 
exercise 11.4.3). A dual notion is that of a STRONG topology, a topology induced by a family of 
functions mapping from topological spaces Xa into S. It is the strongest topology under which all 
these functions are continuous. The only strong topology for which we will have much use is the 
quotient topology. 

11.6.1. Definition. Let X be a topological space, S be a set, and vr: X — t- S* be a surjection. The 
family 

T = {[/ C 5: TT^iU) C X} 
is a topology on S. It is the quotient topology determined by vr. 

11.6.2. Proposition. The family 1. defined above is a topology and the function vr is continuous. 
In fact, 1. is the strongest topology under which vr is continuous. 

11.6.3. Example. With its quotient topology (as defined above) a set S is a quotient object in 
the category TOP and the mapping vr is a quotient map (see definition 8.2.1). 
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11.6.4. Exercise. Let ~ be an equivalence relation on a topological space X. Explain carefully 
how to place a topology on the family X/~ of equivalence classes of X in such a way that X/~ 
becomes a quotient object in the category of topological spaces and continuous maps. 

11.6.5. Theorem (Fundamental Quotient Theorem for TOP). Let ~ he an equivalence relation on 
a topological space X and vr: X — t- X/~ he the associated quotient map. Then for every topological 
space Y and every continuous function f : X ^ Y which is constant on the equivalence classes 
of X there exists a unique continuous function f : X/ ~ — )■ K which makes the following diagram 
commute. 

X 

TT 

XI ^Y 

J 

Furthermore, f is injective if and only if f[x) is different from f{y) whenever x and y are not 
equivalent; and f is surjective if and only if f is. 

11.6.6. Example. In the category TOP not every epimorphism is a quotient map. 

Hint for proof . Consider the identity map on the reals with different topologies on the domain 
and codomain. 

11.6.7. Definition. A mapping f : X ^ Y is said to be open if it takes open sets to open sets; 

o o 

that is, if f~^{U) C Y whenever [/ C X. It is CLOSED if it takes closed sets to closed sets. 

11.6.8. Example. The mapping vr : [0, 1] — ?■ S"*^ : 1 1— ?• e^'^** shows that a quotient map need not be 
open. 

11.6.9. Exercise. Your pal Fred R. Dimm has been considering two topological spaces: the interval 
[0, 1) and the unit circle in the complex plane, each with its usual topology. He has decided 
that the latter is a quotient object of the former and that the function vr : [0, 1) — ?• : t i— t- e^'^** 
is the quotient map. To prove this he considers a function / mapping to a topological space 
X and assumes that / o vr is continuous. Now, he argues, if U is an open subset of X, then 
7r^(/^(C/)) = (/ o 7r)^(C/) is an open subset of [0,1). But according to the definition of the 
quotient topology this means that f^{U) is an open subset of Thus by definition 11.1.1 / is 
continuous. Help Fred to see the error of his ways and show him that vr is not a quotient map. 
(What is the principal difference between this exercise and the preceding example?) 

11.6.10. Example. The usual coordinate projection vri : — )• M: (x, y) i— )• x shows that a quotient 
map need not be closed. 

11.6.11. Proposition. In the category TOP every open epimorphism is a quotient map. 

11.6.12. Proposition. In the category TOP every closed epimorphism is a quotient map. 
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NORMED LINEAR SPACES 

12.1. Norms 

In the world of analysis the predominant denizens are function spaces, vector spaces of real or 
complex valued functions. To be of interest to an analyst such a space should come equipped with 
a topology. Often the topology is a metric topology, which in turn frequently comes from a norm. 
We begin this section with the definition of "norm" and show how norms lead to metrics on vector 
spaces. 

12.1.1. Definition. Let 1^ be a vector space. A function || || : y — t- M: x i— t- ||x|| is a norm on V 
if 

(1) \\x + y\\ < \\x\\ + llyll for all x, y £ V; 

(2) lla^ll = \a\ \\x\\ for all x G F and q G M; and 

(3) if ||x|| = 0, then x = 0. 

(Of course, when F is a complex vector space, (2) should hold for all a G C.) The expression ||x|| 
may be read as "the norm of x" or "the length of x". If the function || || satisfies (1) and (2) above 
(but perhaps not (3) it is a seminorm on V. A vector space on which a norm has been defined is 
a NORMED LINEAR SPACE (or NORMED VECTOR SPACE). A vector in a normed linear space which 
has norm 1 is a UNIT VECTOR. 

12.1.2. Exercise. Show why it is clear from the definition that ||0|| = and that norms (and 
seminorms) can take on only positive values. 

The most important fact is that every normed linear space is a metric space. More precisely, 
a norm on a vector space induces a metric d, which is defined by d{x,y) = \\x — y\\. That is, the 
distance between two vectors is the length of their difference. 




y 

If no other metric is specified we always regard a normed linear space as a metric space under 
this induced metric. Thus every metric (and hence every topological) concept makes sense in a 
(semi)normed linear space. 

12.1.3. Example. Let 1^ be a normed linear space. Define d: V x V ^ M hy d{x,y) = ||x — y||. 
Then d is a metric on V . If V is only a seminormed space, then d is a pseudometric. 

12.1.4. Example. The absolute value function is a norm on M. Obviously, it induces the usual 
metric on the real line (see example 10.4.6). 

12.1.5. Example. For x = (xi,...,x„) E M" let ||x|| = i^^^iXk^Y^"^ ■ The only nonobvious 
part of the proof that this defines a norm on is the verification of the triangle inequality. But 
we have already done this: it is just Minkowski's inequality 6.5.2. This is the usual norm (or 
Euclidean norm) on W^] unless the contrary is explicitly stated, when regarded as a normed 
linear space will always be assumed to possess this norm. It is clear that this norm induces the 
usual (Euclidean) metric on (see example 10.4.11). 
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12.1.6. Example. For x = G let ||x||i = 'Yllt=i\^k\- The function x >->• ||x||i is 
easily seen to be a norm on M". It is sometimes called the 1-norm on M". It induces the taxicab 
metric on (see example 10.4.12). 

The next four examples are actually a single example. The first two are special cases of the 
third, which is in turn a special case of the fourth. 

12.1.7. Example. For x = G M" let \\x\\u = max{|xfc|: 1 < A; < n}. This defines 
a norm on M"; it is the UNIFORM NORM on M". It induces the uniform metric on M" (see exam- 
ple 10.4.13). 

12.1.8. Example. The set loo of all bounded sequences of real numbers is clearly a vector space 
under pointwise operations of addition and scalar multiplication. For x = (xi,X2, • • • ,) S ^oo let 
||x||oo = sup{ : 1 < k}. This defines a norm on l^o] it is the uniform norm on l^o- 

12.1.9. Example. Let 5 be a nonempty set. If / is a bounded real valued function on S, let 

ll/IU :=sup{|/(x)|:xE5}. 

This is a norm on B{S) and is called the uniform norm. Clearly this norm gives rise to the 
uniform metric on B{S) (see example 10.4.32). Notice that examples 12.1.7 and 12.1.8 are special 
cases of this one. (Let S = N„ or 5 = N.) 

12.1.10. Example. It is easy to make a substantial generalization of the preceding example by 
replacing the real line M by an arbitrary normed linear space. Suppose then that S is a nonempty 
set and ^ is a normed linear space. For / in B{S,V) let 

ll/IU :=sup{||/(x)||:xG5}. 

Then this is a norm and it is called the uniform norm on B{S,V). (Why is the word "easy" in 
the first sentence of this example appropriate?) 

12.1.11. Definition. Let S" be a set, y be a normed linear space, and (/„) be a sequence of 
functions in J-{S, V). If there is a function g in J-{S, V) such that 

sup{||/„(x) — ^(x)!! : X G 5} — as n — )■ oo, 

then we say that the sequence (/„) converges uniformly to g and write fn — ^ (7(unif). The 
function g is the UNIFORM LIMIT of the sequence [fn)- Notice that if g and all the /n's belong to 
B{S, V), then uniform convergence of {fn) to g is just convergence of (/n) to g with respect to the 
uniform metric. 

There are many ways in which sequences of functions converge. There is a detailed discussion 
of several of these in chapter 27. Arguably the two most common modes of convergence are uniform 
convergence, which we have just discussed, and pointwise convergence which we have encountered 
in examples 11.4.5 and 11.4.8. 

12.1.12. Definition. Let 5 be a set, V he a. normed linear space, and {fn) be a sequence in 
J-{S, V). If there is a function g such that 

fn{x) — )■ ^(x) for all X G 5, 

then {fn) CONVERGES POINTWISE to g. In this case we write 

fn 5(ptws). 

The function g is the POINTWISE limit of the /n's. 

If {fn) is an increasing sequence of real (or extended real) valued functions and fn^g (ptws), we 
write fn \ 9 (ptws). And if {fn) is decreasing and has 5 as a pointwise limit, we write fnig (ptws). 

The most obvious connection between these two types of convergence is that uniform conver- 
gence implies pointwise convergence. 
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12.1.13. Proposition. // a sequence {fn) in J-{S,V) (where S is a set and V is a normed linear 
space) converges uniformly to a function g in J-{S,V), then {fn) converges pointwise to g. 

The converse is not true. 

12.1.14. Example. For each n E N let /„: [0, 1] — )• M: x i— )• x"^. Then the sequence (/„) converges 
pointwise on [0, 1], but not uniformly. 

12.1.15. Example. For each n G N and each x G let fn{x) = ^x. Then on each interval of 
the form [0, a] where a > the sequence (fn) converges uniformly to the constant function 0. On 
the interval [0, oo) it converges pointwise to 0, but not uniformly. 

A useful observation is that a uniform limit of bounded functions must itself be bounded. 

12.1.16. Proposition. Let S be a set, V be a normed linear space, and (fn) be a sequence in 
B{S, V) and g be a member of J-'{S, V). If fn ^ g (unif), then g is bounded. 

And a uniform limit of continuous functions is continuous. 

12.1.17. Proposition. Let X be a topological space, V be a normed linear space, and [fn) be a 
sequence in C{X,V) and g be a member of J-'{X,V). If fn ^ 9 (unif), then g is continuous. 

Spaces of bounded real finitely additive set functions provide us with more examples of normed 
linear spaces. 

12.1.18. Example. Let 2t be an algebra of subsets of a nonempty set S. For /x E ba(5) let 

||/x|| := sup{|/i(^)|: A G 21} . 
This function is a norm on ba(5), and thus ba(S') is a normed linear space. 

12.1.19. Proposition. If x and y are elements of a vector space V equipped with a seminorm \\ \\, 
then 

I — ||y|| I < — y\\ . 

12.1.20. Corollary. A seminorm on a vector space is uniformly continuous. 

12.1.21. Proposition. If V is a normed linear space, if x £ V , and if r, s > 0, then 

(a) Brio) = -5,(0); 

(b) BrsiO) = rBsiO); 

(c) X + Br{0) = Br{x); and 

(d) Br{0) + -6,(0) = 2Br{0). (Is it true in general that A + A = 2A when A is a subset of a 
vector space?) 

12.1.22. Proposition. Prove that in a normed linear space every open ball is a convex set. And 
so is every closed ball. 

12.1.23. Proposition. Let V be a normed linear space. For each a £V the map Ta'. V ^ V : x 

X + a ( called TRANSLATION by a) is a homeomorphism. 

12.1.24. Corollary. If U is a nonempty open subset of a normed linear space V, then U — U 
contains a neighborhood ofO. 

1 " 

12.1.25. Proposition. If (xn) is a sequence in a normed linear space and Xn ^ a, then —} Xk — )• 

n ^-^ 

k=l 

12.1.26. Proposition. Let V be a vector space with two norms \\ ■ ||i and \\ ■ II2. Let Tfc be the 

topology induced on V by \\ ■ \\k (for k = 1,2). If there exists a constant a > such that \\x\\i < 
q;||x||2 for every x E V, then Ti C T2. 

In definition 10.4.24 we said what it means for two metrics to be strongly equivalent. The 
corresponding condition for norms is called simply equivalence. 
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12.1.27. Definition. Two norms on a vector space V are equivalent if there exist constants a, 
/3 > such that for all x e V 

a||x||i < ||a;||2 < /3||a;||i • 

12.1.28. Proposition. // || ||i and \\ ||2 are equivalent norms on a vector space V, then they 
induce the same topology on V. 

Exercise 12.1.28 gives us an easily verifiable sufficient condition for two norms to induce identical 
topologies on a vector space. Thus, if we are trying to show, for example, that some subset of a 
normed linear space is open it may very well be the case that the proof can be simplified by replacing 
the given norm with an equivalent one. 

Similarly, suppose that we are attempting to verify that a function / between two normed 
linear spaces is continuous. Since continuity is defined in terms of open sets and equivalent norms 
produce exactly the same open sets (see proposition 12.1.28), we are free to replace the norms on 
the domain of / and the codomain of / with any equivalent norms we please. This process can 
sometimes simplify arguments significantly. (This possibility of simplification, incidentally, is one 
major advantage of giving a topological definition of continuity in the first place.) 

12.1.29. Definition. A sequence a = (an) of vectors in a real or complex normed linear space V is 
SUMMABLE if there exists a vector b £ V such that — Sn|| — s-Oasn— s-oo, where Sn = Y12=i '^k is 
the n**^ partial sum of the sequence a. The sequence a is absolutely summable if < oo. 

12.1.30. Exercise. Let li be the set of all absolutely summable sequences of real numbers. Make 
li into a vector space with the usual pointwise definition of addition and scalar multiplication. For 
every a G /i define 

oo 

||a||i := y^lQfcl 
fc=i 

and 

||a||„ := sup{|a/c| : k G N}. 

(These are, respectively, the 1-norm and the uniform norm.) Show that both || ||i and || \\u are 
norms on /i. Then prove or disprove: 

(a) If a sequence (a*) of vectors in the normed linear space (/i,|| ||i) converges, then the 
sequence also converges in {li, \\ 

(b) If a sequence (a*) of vectors in the normed linear space (/i,|| ||m) converges, then the 
sequence also converges in (/i, || ||i). 

12.2. Bounded Linear Maps 

Analysts work with objects having both algebraic and topological structure. In the preceding 
section we examined vector spaces endowed with a topology derived from a norm. If these are the 
objects under consideration, what morphisms are likely to be of greatest interest? The plausible, and 
correct, answer is maps which preserve both topological and algebraic structures, that is, continuous 
linear maps. The resulting category is denoted by NLSqo- The isomorphisms in this category 
are, topologically, homeomorphisms. It is clear that one might choose instead to study a category 
in which the mappings satisfy a stronger condition: the isomorphisms are isometrics, that is they 
preserve distances (equivalently, they preserve norms). In this category, denoted by NLSi, the 
morphisms are CONTRACTIVE LINEAR MAPS, that is, linear maps T: V ^ W between normed 
linear spaces such that ||rx|| < ||x|| for all x £ V. It seems appropriate to refer to NLSqo as 
the topological category of normed linear spaces and to NLSi as the geometric category of normed 
linear spaces. Many authors use the unmodified term "isomorphism" for an isomorphism in the 
topological category NLSqo and "isometric isomorphism" for an isomorphism in the geometric 
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category NLSi. In these notes we will focus on the topological category. The isometric theory of 
normed linear spaces although important is somewhat more specialized. 

12.2.1. Exercise. Verify the unproved assertions in the preceding paragraph. In particular, prove 
that 

(a) NLSoo is a category. 

(b) An isomorphism in NLSqo is both a vector space isomorphism and a homeomorphism. 

(c) A linear map between normed linear spaces is an isometry if and only if it is norm preserv- 
ing. (Definitions. For /c = 1, 2 let T4 be a normed linear space, || ||fc be the norm on Vk, 
dk be the metric on Vfc induced by || and /: Vi — )■ V2. Then / is an isometry (or 
an ISOMETRIC map) if d2{f{x), f{y)) = y) for all x, y £Vi. It is norm preserving 
if ||/(x)||2 = ||x||i for all X £ Vi.) 

(d) NLSi is a category. 

(e) An isomorphism in NLSi is both an isometry and a vector space isomorphism. 

(f) The category NLSi is a subcategory of NLSoo in the sense that every morphism of the 
former belongs to the latter. 

Next we consider a very useful condition which, for linear maps, turns out to be equivalent to 
continuity. 

12.2.2. Definition. A linear transformation T: V ^ W between normed linear spaces is bounded 
if T{B) is a bounded subset of W whenever S is a bounded subset of V . In other words, a bounded 
linear map takes bounded sets to bounded sets. We denote by ^{V, W) the family of all bounded 
linear transformations from V into W . A bounded linear map from a space V into itself is often 
called a (bounded linear) operator. The family of all operators on a normed linear space V 
is denoted by 53 (y). The class of normed linear spaces together with the bounded linear maps 
between them constitute a category. We verify below that this is just the category NLSoo- 

12.2.3. CAUTION. It is extremely important to realize that a bounded linear map will not, in 
general, be a bounded function in the sense of definitions 4.4.13 or 10.4.31. The use of "bounded" 
in these two confiicting senses may be unfortunate, but it is well established. 

12.2.4. Example. The linear map TiM— )-]R:xi— 7>3j;isa bounded linear map (since it maps 
bounded subsets of M to bounded subsets of M), but, regarded just as a function, T is not bounded 
(since its range is not a bounded subset of M) . 

The following observation my help reduce confusion. 

12.2.5. Proposition. A linear transformation, unless it is the constant map that takes every vector 
to zero, cannot he a hounded function. 

One of the most striking aspects of linearity is that for linear maps the concepts of continuity, 
continuity at a single point, and uniform continuity coalesce. And in fact they are exactly the same 
thing as boundedness. 

12.2.6. Theorem. Let T: V ^ W he a linear transformation between normed linear spaces. Then 
the following are equivalent: 

(i) T is continuous at 0. 

(ii) T is continuous on V. 

(iii) T is uniformly continuous on V . 

(iv) The image of the closed unit ball under T is bounded. 

(v) T is bounded. 

(vi) The exists a number M > such that \\Tx\\ < M||a;|| for all x £ V . 

12.2.7. Proposition. Let T: V ^ W be a hounded linear transformation between normed linear 
spaces. Then the following four numbers (exist and) are equal. 
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(i) sup{||rx||: ||x|| < 1} 

(ii) sup{||Tx||: ||x|| = 1} 

(iii) sup{||Ta;|| \\x\\-^ : x / 0} 

(iv) inf{M > 0: ||rx|| < M||x|| for all x e V} 

12.2.8. Definition. If T is a bounded linear map, then ||T||, called the norm of T, is defined to 
be any one of the four expressions in the previous exercise. 

12.2.9. Example. If V and W are normed linear spaces, then the function 

II \\: ^{V,W) ^R: T ^ \\T\\ 

is, in fact, a norm. 

12.2.10. Example. The family ^{V, W) of all bounded linear maps between normed linear spaces 
is itself a normed linear space. 

12.2.11. Example. Let T: ^ M^: {x,y) ^ {3x,x + 2y,x - 2y). Then ||T|| = ^/U. 

12.3. Products of Normed Linear Spaces 

12.3.1. Example. Let V and W be normed linear spaces. On the Cartesian product V xW define 

\\ix,y)h = V¥¥+W, 
||(x,y)||i = ||x|| + ||y||, 

and 

||(a:,y)|U = max{||x||, ||y||} . 

The first of these is the Euclidean norm (or 2-norm) on y x T^, the second is the 1-norm, and 
the last is the uniform norm. Verifying that these are all norms on 1/ x 1^ is quite similar to 
the arguments required in examples 12.1.5, 12.1.6, and 12.1.7. That they are equivalent norms is a 
consequence of the following inequalities and exercise 12.1.28. 

\\x\\ + ||y|| < \/2V||x||2 + ||y||2 < 2max{||x||, ||y||} < 2(||x|| + ||y||) 

(The first inequality is an easy consequence of exercise 6.4.1(d).) 

12.3.2. Convention. In the preceding example we defined three (equivalent) norms on the product 
space V xW. We will take the first of these || ||i as the product norm onV xW. Thus whenever 
V and W are normed linear spaces, unless the contrary is specified we will regard the product V xW 
as a normed linear space under this norm. Usually we write just ||(x,y)|| instead of ||(x, 2/)||i. The 
product of the normed linear spaces V and W is usually denoted by and is called the direct 
sum of V and W. (In the special case where V = W = M, what metric does the product norm 
induce on M^?) 

12.3.3. Proposition. IfV and W are normed linear spaces, then the product topology onV xW 
is the same as the topology induced by the product norm on V (BW . 

12.3.4. Example. Show that product V (BW of normed linear spaces is in fact a product in the 
category NLSoo of normed linear spaces and bounded linear maps. What are the coproducts in 
this category? 

12.3.5. Exercise. Identify the product and coproduct of two spaces V and W in the category 
NLSi of normed linear spaces and linear contractions. 

12.3.6. Exercise. Let {xn, yn) be a sequence in the direct sum y of two normed linear spaces. 
Prove that (x„, y„) converges to a point (a, b) in V (B W and only if Xn ^ a in V and yn ^ b 
in W. 
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12.3.7. Proposition. Addition is a continuous operation on a normed linear space V . That is, 
the map 

A: V ®V ^V: {x,y)^ x + y 

is continuous. 

12.3.8. Exercise. Give a very short proof (no e's or (5's or open sets) that if and are 
sequences in a normed linear space which converge to a and 6, respectively, then a:„ + y„ — )• a + 6. 

12.3.9. Proposition. Scalar multiplication is a continuous operation on a normed linear space V 
in the sense that the map 

S : M. yi V ^ V : {a,x) ^ ax 

is continuous. 

12.3.10. Proposition. If B and C are subsets of a normed linear space and a is a scalar, then 

(a) aB = aB; and 
{h)B + CC W+C. 

12.3.11. Example. If B and C are closed subsets of a normed linear space, then it does not 
necessarily follow that B + C is closed (and therefore B + C and B + C need not be equal). 

12.3.12. Example. Let X be a topological space. Then the family C{X) of all continuous functions 
on X is a vector space under the usual pointwise operations of addition and scalar multiplication. 

12.3.13. Example. Let X be a topological space. We denote by Cb{X) the family of all bounded 
continuous functions on X . It is a normed linear space under the uniform norm. In fact, it is a 
subspace of 13{X) (see example 12.1.9). 

12.3.14. Definition. Let A be a (real) algebra on which a norm has been defined. Suppose that 
additionally the submultiplicative property 

< ||y|| 

is satisfied for all x, y G A. Then A is a (real) normed algebra. (Much later in these notes 
we will also consider complex normed algebras, where the scalars come from C rather than M.) We 
make one further requirement: if the algebra A is unital, then 

||1|| = 1. 

In this case ^ is a UNITAL NORMED algebra. 

12.3.15. Example. In example 12.1.9 we showed that the family B{S) of bounded real valued 
functions on a set S is a normed linear space under the uniform norm. It is also a commutative 
unital normed algebra. 

12.3.16. Proposition. Multiplication is a continuous operation on a normed algebra V in the 
sense that the map 

M: V xV ^V: {x,y) ^ xy 

is continuous. 

Hint for proof . If you can prove that multiplication on the real numbers is continuous you can 
almost certainly prove that it is continuous on arbitrary normed algebras. 

12.3.17. Example. The family Cb{X) of bounded continuous real valued functions on a topological 
space X is, under the usual pointwise operations and the uniform norm, a commutative unital 
normed algebra. 

12.3.18. Example. The family *B(y) of all (bounded linear) operators on a normed linear space 
y is a unital normed algebra. 
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12.3.19. Definition. In 4.5.12 we discussed the algebraic dual of a vector space V. Of great 
importance in normed spaces is the study of the continuous members of V\ the continuous linear 
Junctionals, also known as bounded linear junctionals. The set V* of all such functionals on the 
normed space V is the dual space of y. To distinguish it from the algebraic dual and the order 
dual, it is sometimes called the norm dual or the topological dual of V . 

12.3.20. Example. If S" is a set and a G S, then the evaluation functional 

Ea:B{S)^R:f^f{a) 

is a bounded linear functional on the space B{S) of all bounded real valued functions on S. If S 
happens to be a topological space then we may also regard Ea as a member of the dual space of 
Cb{S). (In either case, what is ||£'a||?) 

12.4. Quotients of Normed Linear Spaces 

In discussing quotients in the category NLSoo we use the notation introduced for quotient 
vector spaces in definition 8.2.8. 

12.4.1. Proposition. Let V be a normed linear space and M be a closed subspace ofV. Then the 
map 

II II : V/M ^ M: [x] ^ inf{||n|| : u ~ x} 
is a norm on V/M . Furthermore, the quotient map 

tt:V ^ V/M : x^[x\ 
is a bounded linear surjection with ||7r|| < 1. 

12.4.2. Exercise. Revisit exercise 8.2.10 and explain how the example given there might motivate 
the definition of the norm on the quotient space V/M in proposition 12.4.1. 

Next we give the analogue of theorems 2.11.9, 8.2.6, and 8.2.11 in the category of normed linear 
spaces and bounded linear maps. 

12.4.3. Theorem (Fundamental quotient theorem for NLSoo)- Let V and W be normed linear 
spaces and M be a closed subspace ofV. IfT is a bounded linear map from VtoW and ker T D M , 
then there exists a unique bounded linear map T: V/M — t- W which makes the following diagram 
commute. 

V 

IT 

V/M 

T 

Furthermore: \\T\\ = \\T\\; T is infective if and only ifkeiT = M; and T is surjective if and only 
if T is. 
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DIFFERENTIATION 



We now pause for a very brief review of differential calculus. The central concept here is 
differentiability. A function / between normed linear spaces is said to be differentiable at a point p if 
(when the point (p, f{p)) is translated to the origin) the function is tangent to some continuous linear 
map. In this chapter we make this precise and record a few important facts about differentiability. 
A more detailed treatment can be found in my ProblemText in Advanced Calculus [17], chapters 
25-29. 

There are two sorts of textbooks on differential calculus: concept oriented and computation 
oriented. It is my belief that students who understand the concepts behind differentiation can 
do the calculations, while students who study calculations only often get stuck. Among the most 
masterful presentations of concept oriented differential calculus are [ l,)] (volume I, chapter 8) and 
[30] (chapter 3). As of this writing the latter book is available without charge at the website of 
one of the authors: 

http : //www . math . harvard . edu/~shlomo/docs/ Advanced_Calculus . pdf 



The material in this chapter will benefit primarily those whose only encounter with multivariate 
calculus has been through partial derivatives and a chain rule that looks something like 

dw dw dx ^ dw dy ^ dw dz 
du dx du dy du dz du 
The approach here is intended to be more geometric, emphasizing the role of tangency. 

13.1. Tangency 

13.1.1. Notation. Let V and W be normed linear spaces and a G V. We denote by J-'a{V, W) the 
family of all functions defined on a neighborhood of a taking values in W. That is, / belongs to 

o 

Ta{V, W) if there exists a set U such that a G U CI dom/ C V and if the image of / is contained 
in W. We shorten J^a{y,W) to J-a when no confusion will result. Notice that for each a € V, 
the set Ta is closed under addition and scalar multiplication. (As usual, we define the sum of two 
functions / and g in Ta to be the function f + g whose value at x is f{x) +g{x) whenever x belongs 
to dom/ n domg.) Despite the closure of Ta under these operations, is not a vector space. 
(Why not?) 

13.1.2. Definition. Let V and W be normed linear spaces. A function / in J^o(KW^) belongs to 
D{V, W) if there exist numbers c > and 6 > such that 

11/(^)11 <c||x|| 

whenever < 5. 

A function / in J^o{V, W) belongs to o{V, W) if for every c > there exists 6 > such that 

11/(^)11 <c||x|| 

whenever < 6. Notice that / belongs to o{V, W) if and only if /(O) = and 

h^O \\h\\ 
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When no confusion seems likely we will shorten 0{V, W) to O and o{V, W) to o. 

13.1.3. Exercise. Here is a list summarizing the important facts about the families D and o. 
State precisely what each of these says and give a proof. (Here 55 is the set of bounded linear maps 
between normed linear spaces V and W, and Co is the set of all functions in J^o{V, W) which are 
continuous at 0.) 



(1) 


OS U c D c Co . 


(2) 


QSn = 0. 


(3) 


+ DCD; aOcO. 


(4) 


O + oCo; aoCo. 


(5) 


o D c . 


(6) 


D o c . 


(7) 


o{V,R)-W ^ o{V,W). 


(8) 


0{V,R) ■ 0{V,W) C oiV,W) 



13.1.4. Definition. Let V and W be normed linear spaces. Two functions / and g in J^Q(y,W) 

are TANGENT (at zero), in which case we write f — g,iif — g & o{V, W). 

13.1.5. Proposition. The relation of tangency enjoys the following properties. 

(a) "Tangency at zero" is an equivalence relation on Fq. 

(b) Let S,T and f eTo- If S f and T ~ /, then S = T. 

(c) If f — g and j ~ k, then f + j — g + k, and furthermore, af 2± cx.g for all a G M. 

(d) Let (/), V' G -^o(^, R) and w G W . // ~ V; ^hen cf>w ~ ipw. 

(e) Letf, g£Tolv,W) andT e^{W,X). If f ^g, then T o f T o g. 

(f) Let h e 0{V, W) and f, ge J'oiW, X). If f ^g, then fohc^goh. 

13.2. The Diflferential 

13.2.1. Definition. Let V and W be normed linear spaces, a eV, and / G J-'a{V, W). Define the 
function A/a by 

A/„(^) := /(a + h)- fia) 

for all h such that a + /i is in the domain of /. Notice that since / is defined in a neighborhood 
of a, the function A/a is defined in a neighborhood of 0; that is. A/a belongs to J-'o{V, W). Notice 
also that A/a(0) = 0. 

13.2.2. Proposition. // V and W are normed linear spaces and a G V, then the function A has 
the following properties. 

(a) Iffe Ta{V, W) and aeR, then 

A{af)a = aAfa. 

(b) Iff,geTa{V,W), then 

A{f + g)a = Afa + Aga. 

(c) If(f)e Ta{V,R) and f e Ta{V,W), then 

^{H)a = 0(a) ■ A/a + A,^a " /(a) + A,^a • A/a . 

(d) If f G Ta{V,W), g e Ff^a){W,X), andgofeTa{V,X), then 

Aigof)a = Ag^^^^oAfa. 

(e) A function f : V ^ W is continuous at the point a in V if and only if A/a is continuous 
atO. 
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(f ) If f : U ^ Ui is a bijection between subsets of arbitrary vector spaces, then for each a in 
U the function A fa ■ U — a ^ Ui — f{a) is invertible and 

13.2.3. Definition. Let V and W be normed linear spaces, a G V, and / € Ta{y,W). We say 
that / is DIFFERENTIABLE AT a if there exists a bounded linear map which is tangent at to A/q. 
If such a map exists, it is called the differential of / at a and is denoted by dfa- Thus dfa is 
just a member of ^{V,W) which satisfies dfa — Afa- We denote by VaiV^W) the family of all 
functions in J-a{V, W) which are differentiable at a. We often shorten this to Va- 

We establish next that there can be at most one bounded linear map tangent to A/q. 

13.2.4. Proposition. Let V and W be normed linear spaces and a G F. If f G Va{y,W), then 
its differential is unique. 

13.2.5. Exercise. Let 

/ : ^ : (x, y, z) ^ [x^y - 7, 3xz + 4y) 

and a = (1,-1,0). Use the definition of "differential" to find dfa- Hint. Work with the matrix 
representation of dfa- Since the differential must belong to *B(M^,M^), its matrix representation is 

a 2 X 3 matrix ^ ~ ^ y ■ Use the requirement that \\Afa{h) — Mh\\ — as /i — > to 

discover the identity of the entries in M. 

13.2.6. Exercise. Let F: be defined by F{x,y) = {y,x'^,4.- xy,7x), and let p = (1, 1). 
Use the definition of "differentiable" to show that F is differentiable at p. Find the (matrix 
representation of the) differential of F at p. 

13.2.7. Proposition. Let V and W be normed linear spaces and a £ V . If f £ T>a, then Afa G O; 
thus, every function which is differentiable at a point is continuous there. 

13.2.8. Proposition. Let V and W be normed linear spaces and a G V. Suppose that f,g£ 
T>aiV, W) and that a G M. Then 

(1) af is differentiable at a and 

d{af)a = adfa] 

(2) also, f + g is differentiable at a and 

d{f + g)a = dfa + dga . 

Suppose further that 4> G I^a{y, Then 
(c) (Pf£Va{V,W) and 

d{<Pf)a = dcpa ■ f{a) + (t){a) dfa . 

It seems to me that the version of the chain rule given in (13.1), although (under appropriate 
hypotheses) a correct equation, really says very little. The idea that should be conveyed is that 
the best linear approximation to the composite of two smooth functions is the composite of their 
best linear approximations. 

13.2.9. Theorem (The Chain Rule). Let V , W , and X be normed linear spaces with a ^ V . If 
f£Va{V,W) andgGVfi^a){W,X), then g o f eVa{V,X) and 

d{g o f)a = dg^^^^ o dfa ■ 
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Proof. Our hypotheses are Afa ^ dfa and A(7j(„) ~ dg^f^^)- By proposition 13.2.7 A/a G D. 
Then by proposition 13.1.5(f) 

^9f{a)°^fa^dgf(^a)°^fa (13.2) 

and by proposition 13.1.5(e) 

dgf(a)° ^fa^dgf(^^)odfa. (13.3) 

According to proposition 13.2.2(d) 

A(5o/),~ A5^(a)oA/a. (13.4) 
From (13.2), (13. 3), (13. 4), and proposition 13.1.5(a) it is clear that 

^{g° f)a ^ dgf(^a) ° dfa- 

Since ^^^/(a) ° dfa is a bounded Hnear transformation, the desired conclusion is an immediate con- 
sequence of proposition 13.2.4. □ 

13.2.10. Exercise. Derive (under appropriate hypotheses) equation (13.1) from theorem 13.2.9. 

13.2.11. Exercise. Let T be a linear map from M" to M"" and p G R". Find dTp. 

13.2.12. Example. Let T be a symmetric n x n matrix and let p € M". Define a function 
/: M" ^ M by /(x) = (Tx,x). Then 

d/p(h) = 2(rp,h) 

for every h G M". 
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RIESZ SPACES 

Ordered vector spaces are a bit too general for our purposes. A much richer theory results 
when we add the requirement that these spaces be lattices. The resulting objects are called Riesz 
spaces. Perhaps the best introduction to these spaces is the book [^'d] by Adriaan Zaanen, one of 
the pioneers in the field. Two other excellent introductory treatments are Chapter 7 of [1] and 
Chapter 35 of [' ]. For more advanced treatments see [3], [31], [33], and [40]. 

14.1. Definition and Elementary Properties 

14.1.1. Definition. A Riesz space (or vector lattice) is an ordered vector space in which 
every pair of elements has a supremum and an infimum. As usual we use the notation x \/ y for 
sup{x, y} and x A y for inf{x, y}. If x is an element of a Riesz space, the element x+ := x V is 
the POSITIVE PART of X and x^ := —{x A 0) is the NEGATIVE PART of x. (Notice that just as the 
imaginary part of a complex number is a real number, the negative part of a Riesz space vector is 
a positive vector in the space.) 

14.1.2. Proposition. An ordered vector space V is a Riesz space if and only if xV exists for 
each X £ V. Furthermore, in any Riesz space the following formulas always hold: 

(a) X V y = X + (y — x)+; and 

(b) xAy = y-{y- x)+. 

14.1.3. Example. The real line R is a Riesz space. 

14.1.4. Example. The ordered vector space with its usual ordering (see example 9.3.6) is a 
Riesz space. 

14.1.5. Example. The ordered vector space M? with lexicographic ordering (see example 9.3.7) is 
a Riesz space. 

14.1.6. Example. If 5 is a nonempty set the family J-{S) of all real valued functions on S (see 
example 9.3.11) is a Riesz space under the usual pointwise partial ordering. 

14.1.7. Example. If 5 is a nonempty set the family B{S) of bounded real valued functions on S 
(see example 4.4.14) is a Riesz space under the usual pointwise partial ordering. 

14.1.8. Example. If X is a nonempty topological space the family C{X) of continuous real valued 
functions on X (see example 12.3.12) is a Riesz space under the usual pointwise partial ordering. 

14.1.9. Theorem (Jordan Decomposition Theorem). The positive cone of a Riesz space is gener- 
ating. In particular, if E is a Riesz space and x £ E, then 

X = x^ — x~ . 

Hint for proof . Add (a) and (b) of proposition 14.1.2. 

14.1.10. Proposition. If x is a vector in a Riesz space, then x^ A x^ = 0. 

The Jordan decomposition is not a unique way of decomposing an element of a Riesz space into 
the difference of two positive elements, but it is "minimal" in the following sense. 

14.1.11. Proposition. // an element x of a Riesz space can be written in the form x = a — b where 
a and b are positive, then x~^ < a and x~ < b. 
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14.1.12. Proposition. // an element x in a Riesz space is written in the form x = a — b where 
a A 6 = 0, then a = x~^ and b = x~ . 

14.1.13. Proposition. For vectors x and y in a Riesz space the following hold: 

(a) = (— x)+ and = {—x)'; 

(b) X < y if and only if x'^ < y'^ and x~ > y~ ; 

(c) {x + y)^ < x^ + y^ and (x + y)~ < x" +y~ . 

14.1.14. Notation. If x is an element of a Riesz space let 

|x| := X V {—x) . 

14.1.15. Proposition. If x is an element of a Riesz space, then \x\ = x^ + x~ . 

Interesting and important examples of Riesz spaces can be found among the spaces of bounded 
real finitely additive set functions. We examine these in items 14.1.16-14.1.20. 

14.1.16. Example. Let 21 be an algebra of subsets of a nonempty set S. Given G ba(5) define 
a function o" on 2t by 

a{A) = s^lY>{^l{B) : A D S G 21} . 
Then o" is a bounded (real) finitely additive set function. 

14.1.17. Example. If S" is a nonempty set, then the ordered vector space ba(5) (see 9.4.10) is a 
Riesz space. 

Hint for proof . Show that the function a defined in the preceding example is the least upper 
bound of ^ and 0. 

14.1.18. CAUTION. An especially interesting aspect of the spaces ba(5) is that suprema, infima, 
and absolute values are in general not evaluated pointwise. (See, for example, 14.1.20.) 

14.1.19. Proposition. Let 21 be an algebra of subsets of a set S. If fi, v £ ba(S'), then 

(a) {fi V u){A) = sup{/i(S) + u{C) : A = S W C} and 

(b) in A i^){A) = mf{fi{B) + u{C) : A = B iS C} 

for every j4 G 21. 

14.1.20. Example. Let S = Z n [—5, 5] (that is, the integers between —5 and 5) and 21 = *P(5). 
Then 21 is an algebra of sets. Define /x: 21 — )• M by setting n{A) to be the sum of the elements in 
A for each nonempty AOS. (Also take ^(0) = 0.) Then /j, belongs to the Riesz space ba(5). For 
A = {-1,0,1} we see that (^ V0)(^) / fi{A) V 0{A) and |^|(^) / |/i(^)|. 

The following proposition gives a few formulas which are useful when working with Riesz spaces. 
You can find a much more extensive list (43 formulas, in fact) in [ ' pages 55-57. 

14.1.21. Proposition. Let E be a Riesz space and x, y £ E. Then 

(a) — |x| < X < |x| = |— x| 

(b) |x + y| < |x| + \y\ 

(c) I |x| — |y| I < |x — y\ 

(d) xVy= l{x + y + \x-y\) 

(e) X Ay = ^{x + y -\x - y\) 

(f ) {x\/ y) + {x Ay) = X + y 

(g) (xVy) -{xAy) = \x - y\ 

(h) |x| V \y\ = l{\x + y\ + \x - y\) 

(i) |x| A |y| = ^ I |x + y| — |x — y| I 
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14.1.22. Proposition. Let A be a subset of a Riesz space E and suppose that sup^ exists. Then 
for every b & E 



14.1.23. Remark. In studying lattices one often is told that some proposition holds "by duality". 
This particular form of duality can be regarded as a metamathematical principle derived from 
the observation that the operations V and A appear symmetrically in the definition of "lattice". 
Notice that if {S, <) is a partially ordered set, then so is (5"°^, ^) where x ^ y holds in 5*°^ if and 
only if y < X in S". The partially ordered set (^"p, ^) is the dual (or opposite) of {S, <). Any 
proposition in (S, <) becomes a proposition of (5"°^, ^) if we reverse the order relation (change 
every occurrence of < to ^, which is more commonly written as >) and interchange V and A, lower 
bound and upper hound, minimum and maximum, sup and inf, and so on. In Riesz spaces and, 
more generally, lattices, where there is by definition a symmetry between < and >, propositions 
remain true when the interchanges mentioned above are made. (And so do the proofs! ) Thus for 
example the two assertions of proposition 5.4.11 are dual to one another. It is standard to prove 
one of them and say that the other holds by duality. Don't be surprised if in a text or paper you 
find the dual of a proposition used without the author even mentioning duality. 

14.1.24. Exercise. Duality like any other tool is susceptible of misuse. Explain carefully why 
"duality" cannot be invoked to argue that proposition 5.2.9 is obviously true. What is the dual 
assertion of 5.2.9? 

14.1.25. Exercise. What is the dual of proposition 14.1.22? 

14.1.26. Definition. Let F be a vector subspace of a Riesz space E. If x\/ y (and x Ay) belong to 
F whenever x and y do, then F is a Riesz subspace of E. BEWARE: the condition is not that 
X and y have a supremum in F, but that their supremum in E belongs to F. (See the following 
example.) 

14.1.27. Example. The family of polynomial functions on the interval [0, 1] provide an example 
of a linear subspace of a Riesz space C([0, 1]) which is not a Riesz subspace of C([0, 1]). 

14.1.28. Proposition. Let F be a vector subspace of a Riesz space E. In order for F to be a Riesz 
subspace of E it is sufficient that x V (the supremum of x and in E) belong to F whenever x 
does. 

Hint for solution. This follows easily from exercise 14.1.2. 

14.1.29. Proposition. If a, h, and c are positive elements of a Riesz space, then 



Hint for proof . Let x = o A (6 + c). Show that x — c is dominated by both a and b. Then show 
that X — (a A 6) is dominated by both a and c. 

14.1.30. Proposition (Riesz decomposition lemma). If E is a Riesz space and x, y, u, and v are 

members of E^ such that x + y = u + v, then there exist a, b, c, and d G E'^ such that 



(sup A) f\b = sup{a f\b: a e A] . 



a A (6 + c) < (a A 6) + (a A c) . 



X 



y 



a + b 
c + d 



u 



a + c 



V 



b + d 
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In other words, given x, y, u, and v as above, we can find positive "summands" which satisfy the 
following addition table. 



+ 


b 


c 


a 


X 


u 


d 


V 


y 



Hint for proof . Let a = x Au. The hardest part is showing that d is positive. Show first that 
6 A c = 0. Then use the fact that 6 < f + c to write b as b A {c + v). 

14.1.31. Notation. If a and b are elements of an ordered vector space V, let 

[a,b] := {x £ V : a < X < b}. 
(Obviously, [a, b] is empty unless a < b.) This is the order interval between a and b. 

14.1.32. CAUTION. Notice that we are using the same notation for order intervals that we 
use for closed intervals on the real line and closed segments in vector spaces (see the cautionary 
note 9.2.3). It should be clear from context which is intended. 

14.1.33. Proposition. If E is a Riesz space and x, y £ , then 

[0,x] + [0,y] = [0,x + y]. 

14.1.34. Definition. A sequence {xn) in a partially ordered set S is increasing if Xm < Xn 
whenever m < n. It is decreasing if Xm > Xn whenever m < n. A sequence is monotone if it is 
either increasing or decreasing. 

If (xn) is an increasing sequence in S and a = sup{x„}, we write Xn t Similarly, Xn i a 
means that the sequence (xn) is decreasing and a = inf{a;„}. In either case we say that the sequence 
converges and that the convergence of (x„) to a is monotone. 

14.1.35. Definition. A Riesz space E is Archimedean if n^^x | whenever x G E^ . 

14.1.36. Example. The lexicographically ordered plane is not Archimedean. Recall that the 
lexicographic ordering on is defined by 

{u, v) < (x, y) if and only if either u < x or else u = x and v < y. 

14.1.37. Example. Every cr-Dedekind complete Riesz space is Archimedean. 

14.2. Riesz Homomorphisms and Positive Operators 

14.2.1. Definition. A linear map T: E ^ F between Riesz spaces which is also a lattice homo- 
morphism is a RiESZ HOMOMORPHISM. 

14.2.2. Proposition. For a linear map T: E ^ F between Riesz spaces the following are equiva- 
lent: 

(a) T{x y y) = Tx V Ty for all x,y £ E. 

(b) T is a Riesz homomorphism. 

(c) T{x+) = (Tx)+ for all x £ E. 

(d) T{x Ay) = Tx A Ty for all x,y £ E. 

(e) Tx A Ty = in F whenever x Ay = in E. 

(f) \Tx\ = T\x\ for all x £ E. 

14.2.3. Example. The class of Riesz spaces and the accompanying Riesz homomorphisms form a 
category. It will be denoted by RS. 
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14.2.4. Proposition. In the category RS of Riesz spaces and Riesz homomorphisms, every bijec- 
tive morphism is an isomorphism. 

14.2.5. Definition. A linear map T: V ^ W between ordered vector spaces is positive if Tx € 
whenever x G . (Notice that T is positive if and only if it is order preserving.) As one 

would expect positive members of the algebraic dual (that is, positive linear maps from V into M) 
are called positive linear functionals on V. 

14.2.6. Proposition. Every Riesz homomorphism between Riesz spaces is positive. 
But not every positive linear map is a Riesz homomorphism. 

14.2.7. Example. On the Riesz space i3([0,2]) of bounded real valued functions on the interval 
[0, 2] consider the map T: B{[0,2]) ^R: f ^ f{x)dx. This map is positive but is not a Riesz 
homomorphism. (If g = x^q ^ and h = 2] ' then T{g V h) ^ Tg V Th.) 

14.2.8. Example. Together the class of Riesz spaces and the positive linear maps between them 
constitutes a category. We will denote it by RSP. 

14.2.9. Example. In the category RSP of Riesz spaces and positive linear maps a bijective 
morphism need not be an isomorphism. For example, the identity mapping from M? with its usual 
ordering (see examples 9.3.6 and 14.1.4) to M? with lexicographic ordering (see examples 9.3.7 
and 14.1.5) is a positive bijection but not an isomorphism in the category RSP. 

By definition, a mapping T: E ^ F between Riesz spaces is a Riesz isomorphism if it is 
bijective and both T and are Riesz homomorphisms. 

14.2.10. Proposition. A linear bijection between Riesz spaces is a Riesz isomorphism if and 
only if both T and T^^ are positive. In other words, the categories RS and RSP have the same 
isomorphisms. 

14.2.11. Proposition. Let E and F be Riesz spaces and suppose that F is Archimedean. Then 
every additive map S: E^ — t- F~^ extends uniquely to a positive linear map T: E ^ F. 

Hint for proof . Recall that a map / between vector spaces is additive if f{x+y) = f{x) + f{y) 
for all X and y. When you try to prove additivity of the extension (whose definition is probably 
obvious), keep in mind that (x + y)"*" is not the same thing as x"*" + y"*". Show that \i x = u — v 
where u, v G E~^ , then Tx = Su — Sv. For the homogeneity of the extension you may want to 
prove that Sx < Sy whenever < x < y and that S{rx) = rS{x) for positive rational numbers r 
and X G E^ . 

14.2.12. Definition. Let S,T:V^Whe linear maps between ordered vector spaces. We say 
that T dominates (or majorizes) 5" on F if S{x) < T{x) for ah x £ V^. 

14.2.13. Proposition. Let E be a Riesz space, F a Dedekind complete Riesz space, and G a 
Riesz subspace of E. Suppose that Sq: G ^ F and T : E ^ F are positive and linear and that T 
dominates Sq on G. Then Sq can be extended to a positive linear operator S on all of E which is 
dominated by T. 

14.2.14. Definition. A linear map T: V ^ W between ordered vector spaces is regular if it 
is the difference of positive linear maps. We denote by ^riV, W) the family of all regular (linear) 
maps from y to VF. 

14.2.15. Proposition. A linear map T: E ^ F between ordered vector spaces is regular if and 
only if it is dominated by a positive linear map. 
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14.3. The Order Dual of a Riesz Space 

14.3.1. Definition. A subset of an ordered vector space V is ORDER BOUNDED if it is bounded 

above and bounded below by elements of V. 

14.3.2. Proposition. Let E be a Riesz space. 

(a) An element x belongs to the order interval [—u,u] (where u G E~^) if and only if \x\ < u. 

(b) A subset A of E is order bounded if and only if there exists u G E'^ such that \a\ < u for 
all a ^ A. 

14.3.3. Definition. A linear map T: V ^ W between Riesz spaces is order bounded if it 
takes order bounded sets to order bounded sets; that is, if T{A) is an order bounded subset of 
W whenever A is an order bounded subset of V. We denote by ^) the family of all order 
bounded (linear) maps from V to W. 

14.3.4. Proposition. IfT:V^Wisa linear map between ordered vector spaces, then the 
following are equivalent: 

(i) T is order bounded; 

(ii) T takes order intervals into order intervals; and 

(iii) T takes order intervals of the form [0, x] into order intervals. 

14.3.5. Proposition. Every positive linear map between Riesz spaces is order bounded. 

14.3.6. Proposition. Every regular linear map between ordered vector spaces is order bounded. 

14.3.7. Proposition. If E and F are Riesz spaces and F is Dedekind complete, then £,r{E,F) = 
2biE,F). 

14.3.8. Proposition. // E and F are Riesz spaces and F is Dedekind complete, then F) is 
a Dedekind complete Riesz space with the set of positive operators from VtoWas its positive cone. 

14.3.9. Notation. We denote by the family of all order bounded linear functionals on a Riesz 
space E. This is the ORDER DUAL of E. 

14.3.10. Proposition. If E is a Riesz space, then so is its order dual E"^ . 

Hint for proof. Use the set of all positive linear functionals as the positive cone. To show that 
the supremum of an order bounded linear functional / and the zero functional exists, define g^ on 
E+ by 

50 (x) = sup{/(n) : ue [0,x]}. 

14.3.11. Corollary. Every order bounded linear functional on a Riesz space is the difference of 
two positive linear functionals. 

14.3.12. Proposition. Let E be a Riesz space and f G Then for every x G E'^ 

(a) f+{x) = sup{f{u):u G [0,x]}; 

(b) f~{x) = sup{— /(u): u G [0, x]}; and 

(c) 1/1(2;) = sup{/(u): \u\ < x}; 

14.3.13. Proposition. Let E be a Riesz space and f G E^ . Then for every x E E 

(a) \f\{\x\) = sup{/(m): \u\ < \x\} and 

(b) \f{x)\ < \f\i\x\). 

14.3.14. Proposition. Let E be a, Riesz space and f,g£ E"^ . Then for every x G E^ 

(a) (/ V g){x) = sup{/(m) + g{x — u) : it G [0, x]} and 

(b) (/ A g){x) = inf{/(tx) + g{x-u):ue [0, x]}. 

14.3.15. Proposition. The order dual of a Riesz space is always Dedekind complete. 
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Hint for proof . Start with a nonempty subset A of the order dual that is bounded above. Show 
that without loss of generality we may assume that all the elements of A are positive. For each 
positive element x of the Riesz space let h{x) be the supremum of the set of all numbers s{x) where 
s is a finite supremum of elements of A. Show h = sup A. 
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MEASURABLE SPACES 

15.1. fj-Algebras of Sets 

15.1.1. Definition. A nonempty family 21 of subsets of a nonempty set S is a ct-algebra if it is 
closed under complements and countable unions. That is, we require that 

(1) If OS is a countable subfamily of 21, then |J *B S 21; and 

(2) If ^ G 21, then £ 21. 

If 21 is a cT-algebra of subsets of a set S, then we say that the pair {S, 21) is a measurable space. 
When it is thought that no confusion will result, this is often shortened to "S* is a measurable 
space." When A belongs to 21, we may say that A is a measurable subset of S and write ACS 
(rather than ^ G 21). Compare this with the definitions given for algebras of sets and topologies 
given in 5.6.11 and 10.1.1. 

15.1.2. Exercise. We have just defined a cr-algebra to be a (nonempty) family of sets which is 
closed under taking of complements and countable unions. Show that this is equivalent to defining 
a cr-algebra to be a (nonempty) family of sets which is closed under taking of complements and 
countable intersections. 

15.1.3. Exercise. Let 5^ be a family of subsets of a nonempty set S. Explain carefully what we 
mean by "the smallest a-algebra of sets containing y or "the cr-algebra generated by ^. Prove 
that such a thing exists. (See remark 5.3.6.) 

15.1.4. Notation. If S" is a set and 5^ is a family of subsets of S, let <t^(5) (or, if no confusion 
will result, ct(5^)) be the cr-algebra of subsets of S generated by ^ (that is, the smallest cr-algebra 
of subsets of S which contains 5)- 

It is an interesting, if somewhat unpleasant, aspect of cr-algebras that there appears to be no 
good constructive method of describing the smallest such algebra containing a given family of sets. 
In many cases it is possible to keep generating new sets by alternately taking countable unions and 
complements of sets we already have. As an illustration of the resulting complexity you may wish 
to consider the family ^ of bounded open intervals in M. The smallest cr-algebra containing ^ is 
called the family of Borel sets in M. It is entertaining to try and find a subset of M which is not 
a Borel set. Also notice how difficult it would be to describe an arbitrary Borel set in terms of 
successive unions and complements of the intervals in ^. We will discuss Borel sets in more detail 
in the next section 15.2. 

15.1.5. Example. Let S be an infinite set and 5^ = {{x}: x £ S}. Then asi'S) comprises all 
countable and cocountable subsets of S. (A subset yl of a set S is COCOUNTABLE if its complement 
is countable.) 

15.1.6. Example. Let 5 = [0, 1) and 21 be the family of all subsets of S which are finite unions of 
sets of the form [a, b) where a, 6 € S. Then 21 is an algebra of sets but not a cr-algebra. 

15.1.7. Proposition. Let f : S — )• T be a function from a measurable space (S, 21) into a set T. If 
5S = {B C T: f^{B) E 2t}, then (T, *B) is a measurable space. 

In the next result we make use of the notation '^OA where 5 is a family of sets and yl is a set. 
This means what you probably think it means: 

dnA = {FnA: F G J}- 
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15.1.8. Proposition. Let ^ be a family of subsets of a set S and A S. Then 

aA{dnA) = as{d)nA. 

Hint for proof . It is clear that (7^(^(1 A) C as{^) D A. To establish the reverse inclusion let 
& = {B G crs{d) ■ B n A G CTAid H A)}. Show that © is a u-algebra which contains 5- This is all 
we need, for then crs{S) C ©. In his lovely text [4] Ash labels this the good sets principle. A set in 
f^siS) is "good" if its intersection with A belongs to dAid <^ A). So <5 is the family of all "good" 
sets. All we are required to show is that every member of (Ts{d) is "good". 

15.1.9. Proposition. Let S and T be sets, $ : S" T, and 5" C ^{T). Then 

15.1.10. Exercise. Show how to use propositionl5.1.9 to give a very easy proof of proposi- 
tion 15.1.8. 

15.2. Borel Sets 

In a topological space the most important cj-algebra is the family of Borel sets, the cx-algebra 
generated by the open sets. 

15.2.1. Definition. Let {X, 1) be a topological space. The Borel sets of X are the sets in cr(T). 
Denote the family of Borel sets in X by !Bor(X). 

15.2.2. Example. Every countable and every cocountable subset of M is a Borel set. 

15.2.3. Definition. A nonempty family 21 of subsets of a set S is a Dynkin system if it satisfies 
the following conditions: 

(i) S E 21; 

(ii) if ^, B e 21 and yl C B, then S \ ^ e 21; and 

(iii) if (An) is an increasing sequence of sets in 21 with B = U^i ^n, then G 21. 

15.2.4. Proposition. A nonempty family of subsets of a set S is a a-algebra if and only if it is a 
Dynkin system which is closed under finite intersections. 

15.2.5. Exercise. Let S = {a,b,c,d} be a four-element set. Find a Dynkin system in S which is 
not an algebra of sets. 

15.2.6. Proposition. Let ^ be a Dynkin system in a set S and ^ be a nonempty family of sets 
which is contained in^. If ^ is closed under finite intersections, then cr{^) C 21. 

15.2.7. Definition. A nonempty family Tl of subsets of a set S" is a monotone class provided 
that it satisfies the following condition: if (An) is a sequence of sets in 9Jl such that either An t B 
or An i B, then B £ Tl. 

15.2.8. Proposition. Every a-algebra is a Dynkin system and every Dynkin system is a monotone 
class. 

15.2.9. Proposition. // 21 is an algebra of subsets of a set S, then ct(21) is the smallest monotone 
class in S which contains 21. 

15.2.10. Corollary. An algebra of sets is a a-algebra if and only if it is a monotone class. 

15.2.11. Proposition. If X is a topological space, then the family ?Bor(X) of Borel sets in X is 
the smallest Dynkin system containing the topology of X. 

15.2.12. Definition. A subset of a topological space X is an F^j subset of X if it is a countable 
union of closed subsets of X. It is a Gs subset of X if it is a countable intersection of open 
subsets of X. One way of keeping these definitions straight is to associate the F in F(j with the 
word "ferme " , which is French for closed and the a with "Summe" , which is German for sum. 
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The connection here is that some older monographs use "sum" for "union" (or more specificahy, 
"disjoint union") of sets and "product" for "intersection". Perhaps this makes the "algebra of 
sets" look more algebraic. Also, you may wish to associate the G in Gs with the German word 
" Gebiet, which means region (suggesting open) and the 6 with "Durchschnitt" , which is German 
for intersection. 

15.2.13. Example. Every Gs subset of a topological space X is a Borel subset of X; so is every 
Fa- subset. 

15.2.14. Proposition. A subset of a topological space is an set if and only if its complement 
is a Gs set. 

15.2.15. Example. In a metric space every closed set is a Gs set; and every open set is an F^ set. 

Hint for proof. Let C be a closed subset of a metric space. Consider the function x i— t- d{x, C) 
defined on M. 

15.2.16. Definition. Let / be a real valued function on a topological space X, a be a point in X, 
and iia be the family of all neighborhoods of a. We define the oscillation ujf{a) of / at a by 

Uf{a) := inf { sup{|/(x)-/(y)|}}. 

15.2.17. Proposition. A real valued function f on a topological space is continuous at a point a 
in its domain if and only if ujf{a) = 0. 

15.2.18. Proposition. Let f be a real valued function on a topological space X. The set D{f) of 
points at which f is discontinuous is an F^ subset of X. 

15.3. Measurable Functions 

15.3.1. Definition. Let (5,21) and (T,5S) be measurable spaces. A function f : S ^ T is (21,53)- 
measurable if the inverse image under / of every measurable set is measurable; that is, if f^{B) G 
21 whenever G 5S; equivalently, if f^{B) C S whenever B C T. When no confusion will result, 
we say just that the function is measurable. The family of all measurable functions from S into 
T is denoted by M{S,T). In the frequently occurring case when T = M we shorten A^(S', M) to 
MiS). 

15.3.2. Convention. Whenever a topological space is referred to as a measurable space, without 
further specification, it is understood that the measurable sets are the Borel sets. Thus if (5, 2t) is a 
measurable space, a function / : 5 — t- M is said to be measurable (or, for emphasis, Borel measurable) 
if it is (21, 'Bor(]R))-measurable. Thus for real valued functions measurable means Borel measurable 
unless the contrary is explicitly stated. 

15.3.3. Example. On a measurable space the characteristic function of every measurable set is 
measurable. 

15.3.4. Proposition. The composite of any two measurable functions is measurable. (That is, if 
f : R ^ S and g: S ^ T are measurable functions between measurable spaces, then g o f : R ^ T 
is measurable.) 

15.3.5. Proposition. Let f : S ^ X be a function from a measurable space S into a topological 

o 

space X. Then f is measurable if and only if f^{U) is measurable whenever [/ C X. 
Hint for proof . Proposition 15.1.9. 

15.3.6. Corollary. Every continuous function between topological spaces is (Borel) measurable. 
That is, if X and Y are topological spaces and f : X ^ Y is continuous, then the inverse image 
under f of every Borel set in Y is a Borel set in X . 
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15.3.7. Proposition. If S is a measurable space, then a real valued function f on S is measurable 
if and only if ( (a, oo) ) is measurable for every a E M. 

15.3.8. Proposition. If S is a measurable space and f and g are real valued measurable functions 
on S, then the function (/, g): S ^M.'^ is measurable. 

In the next proposition the codomain of a function / is M^. For /c = 1, 2 we define f^ = ° f 
where vTfc is the k^^ projection function, vrfc: — t- M: {xi,X2) i— >• x^- (See definition 8.3.6.) We 
refer to and as the coordinate functions of /. 

15.3.9. Proposition. Let S be a measurable space. A function f : S ^ M? is measurable if and 
only if its coordinate functions f^ and are. 

15.3.10. Proposition. Let S be a measurable space. If f : S — )■ M is measurable and p > 0, 
then \f\P is measurable. ///: — )• M is measurable and is never zero, then the function 1/f is 
measurable. 

15.3.11. Example. The family A4.{S) of all real valued measurable functions on a measurable 
space is a Riesz space (under the usual pointwise operations). 

15.3.12. Example. The family M{S) of all real valued measurable functions on a measurable 
space S is an algebra (under the usual pointwise operations). 

15.3.13. Proposition. A complex valued function on a measurable space is measurable if and only 
if its real and imaginary parts are. 

15.3.14. Proposition. Let f be an extended real valued function defined on a measurable space 
(5,21). If f^ { (a, oo]) G 21 for every a G M, then f is measurable. 

Hint for proof . Use proposition 15.1.7 and example 11.1.9. 

15.3.15. Proposition. Let (fn) be a sequence of measurable real valued functions defined on a 
measurable space S. Then \l fn, f\fn, liminf /„, and limsup/„ are all measurable. Thus if fn 

g (ptws), then g is measurable. 

15.3.16. Exercise. Let S = [0, 1] x [0, 1] and 21 be the cr-algebra of subsets of S generated by sets 
of the form U x V where either U or is countable and either V or is countable. Does the 
diagonal D = {{x, x) : < x < 1} belong to 21? 

Hint for solution. If E and F are subsets of S and F is uncountable, say that E is large in F 
if F\E is countable and that E is SMALL IN F ifFnE is countable. Let A = {{x,l-x): < x < 1}. 
Consider the family ^ of all subsets of S which are either large in both A and D or else small in 
both A and D. 

15.4. Functors 

15.4.1. Definition. If A and B are categories a COVARIANT functor F from A to B (written 

A 3- B) is a pair of maps: an OBJECT MAP F which associates with each object 5 in A an 

object F{S) in B and a morphism map (also denoted by F) which associates with each morphism 
/ G moxls, r) in A a morphism F{f) G 9JIor(F(5), F{T)) in B, in such a way that 

(1) F{g o /) = F{g) o F{f) whenever g o f is defined in A; and 

(2) F{ids) = idpi^s) fo^ every object S in A. 

F 

The definition of a CONTRAVARIANT functor A B differs from the preceding definition 

only in that, first, the morphism map associates with each morphism / G 5[Ror(5, T) in A a 
morphism F{f) G Tlox{F (T) , F (S)) in B and, second, condition (1) above is replaced by 
(!') F{g o /) = F{f) o F{g) whenever g o f is defined in A. 

15.4.2. Example. Let 5" be a nonempty set. 
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(a) The power set ^{S) of S partially ordered by C is an order complete lattice. 

(b) The class of order complete lattices and order preserving maps is a category. 

(c) For each function / between sets let ^(/) = /~^. Then *p is a covariant functor from 
the category of sets and functions to the category of order complete lattices and order 
preserving maps. 

(d) For each function / between sets let ^(/) = Then is a contravariant functor from 
the category of sets and functions to the category of order complete lattices and order 
preserving maps. 

(e) In parts (c) and (d) can the phrase "order preserving maps" be replaced by "lattice 
homomorphisms" ? 

15.4.3. Example. Let X and Y be topological spaces and (p: X ^ Y he continuous. Define Ccp 
on C{Y) by 

C(t>{g) = go4> 

for all g G C{Y). Then the pair of maps X i— )■ C{X) and (j) i— )• C(j) is a contravariant functor from the 
category of topological spaces and continuous maps to the category of unital algebras and unital 
algebra homomorphisms. 

15.4.4. Example. For every set S let be the family of all cj-algebras of subsets of 5. If (/) : 5 — >• T 
where S and T are sets, define 4>^ on by 

(/)'"(2l) = {ECT: ^^(E) e 21}. 

(a) S"^ is an order complete lattice. 

(b) (j)" maps S"" into T". 

(c) The pair of maps S S"^ , (p (p'^ is a covariant functor from the category of sets and 
maps to the category of order complete lattices and order preserving maps. 

15.4.5. Exercise. Use example 15.4.4 to give a short proof of proposition 15.3.5. 

15.4.6. Example. For each bounded linear map T: V ^ W between normed linear spaces define 

T*:W*^V*:g^goT. 

The map T* is the adjoint of T. Then the pair of maps V V* and T i— )• T* is a contravariant 
functor from the category NLSoo of normed linear spaces and bounded linear maps into itself. We 
will refer to it as the duality functor on NLSoo- 

15.4.7. Example. If C is a category let be the category whose objects are ordered pairs of 

/ 

objects in C and whose morphisms are ordered pairs of morphisms in C. Thus if A and 

are morphisms in C, then {A,B) — ^{C,D) (where {f,g){a,b) = (/(a), (7(6)) for all 

a G A and 6 G -B) is a morphism in C^. Composition of morphism is defined in the obvious way: 

if^g) ° = (/ o h,g° j)- We define the diagonal functor C " > by D{A) := {A, A). 

This is a covariant functor. 
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THE RIESZ SPACE OF REAL MEASURES 

Locally compact spaces (in topology), positive measures (in analysis), and algebras (in operator 
theory) present a common pedagogical problem. In the study of each there is an irresistibly attrac- 
tive special case (compact spaces, real measures, and unital algebras) with properties so pleasant 
and structure so pellucid that one feels almost compelled to present the special case first. But 
here is the problem. By far the most important example of a locally compact space is the real 
line, which is not compact; the most important measure is Lebesgue measure, which is not a real 
measure; and many algebras central to operator theory, the compact operators and the continuous 
functions on M which vanish at infinity, for example, fail to be unital. My solution to the problem 
is to give in to temptation. In these notes we look at real measures in this chapter and Lebesgue 
measure later (in chapter 18). In chapter 17 compact spaces come first, locally compact spaces at 
the end. And in chapter 21 unital algebras are emphasized. 

16.1. Real Measures 

16.1.1. Definition. Let (5, 2t) be a measurable space. A complex (or real) valued function /j, on 

21 is COUNTABLY ADDITIVE if 

oo oo 

fi{B) = ^fi{Ak) whenever B = \^ Ak . (16.1) 

k=l k=l 

A countably additive function /i: 21 — )■ M is a real measure and a countably additive function 
from 21 into C is a COMPLEX measure. Notice that the series Yl'h=i f^i^k) is absolutely convergent, 
since any rearrangement of the series will have the same sum ^{B). 

In case the function /i maps 21 into either (— oo, co] or [— oo, oo) we will say that it is countably 
ADDITIVE if it satisfies (16.1) and also /i(0) = 0. (This last condition excludes the uninteresting case 
of set functions whose value on every set is infinite.) A countably additive set function : 21 — t- [0, oo] 
is a POSITIVE measure; and many authors call a countably additive function from 21 into either 
(—00,00] or [—00,00) a SIGNED measure. Care must be taken to distinguish between positive 
measures and positive real measures: the former may take on infinite values, the latter may not. 

A real measure space is a triple (5,21,^) where (5*, 21) is a measurable space and /x is a real 
measure on 21. Similarly, a complex (respectively, positive or signed measure space is a triple 
(5, 2t, /x) where (5, 21) is a measurable space and /i is a complex (respectively, positive or signed) 
measure on 21. 

In an effort to simplify technical exposition and remove notational clutter authors introduce 
many variants of the preceding terminology. Instead of the expression, "(S", 21, //) is a real measure 
space", you will regularly find in the literature (including these notes) such replacements as "/i 
is a real measure on S"', "5 is a real measure space", and "the real measure space (S, /i)". It is 
not unreasonable to hope that these shortened forms, despite being utterly illogical, will cause no 
confusion. Also I should point out that in many treatments the word measure means what is here 
called positive measure. 

We denote by ca(S', 21) the family of all real measures on a measurable space (5,21). When no 
confusion appears likely we write just ca(5). Similarly, for the family of complex measures on the 
measurable space S we write ca(5, 21, C) or just ca(5, C). 
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16.1.2. Example. Let 5 be a set, 5 E 5, and 21 = For every ^ e 21 define 



Sb{A) 



0, if 6 ^ A. 



Then (5, 21, 5b) is a positive real measure space and the positive real measure 5b is called DiRAC 

MEASURE CONCENTRATED AT b. 



16.1.3. Example. Let 5 be a nonempty set and 21 = ^{S). For every ^ G 21 define 

card A, if A is finite 
oo, if A is infinite. 



Then (S, 21, n) is a positive measure space and the positive measure /U is called COUNTING MEASURE 
on S. If S is finite, then is a positive real measure. 

16.1.4. Notation. If (An) is a sequence of subsets of a set S, then we write An '[ B to indicate 
that the sequence (An) is increasing (that is, An C An+i for each n G N) and that U^i = B. 
Similarly, An i B means that the sequence (An) is decreasing {An 5 An+i for each n G N) and 
that n^i = B. The next result says, roughly, that real measures are a-order continuous 
{sequentially order continuous) from below and from above. 

16.1.5. Proposition. Let S he a measurable space, {An) be a sequence of measurable subsets of S, 
and ^ be a real measure on S. 

(a) If An t B, then fi{An) fi{B). 

(b) If An i B, then ^{An) ^ fi{B). 

The following partial converse of this result is occasionally useful in simplifying the task of 
proving that certain set functions are countably additive. 

16.1.6. Proposition. Let ^ be a real finitely additive set function on a measurable space (5,21). 
Then the following are equivalent: 

(a) jj, is countably additive; 

(b) if the sets An are measurable and An i 0, then fi{An) — )• 0; 

(c) if the sets An are measurable and An t B, then /i(A„) — )• fJ,{B). 

m 

16.1.7. Proposition. Let ^ be a positive real measure on a measurable space S. If An ^ S for 

every n G N and B = lim^„, then 

/x(S) =limfi{An). 
Hint for proof. Consider the sets Bn ■= flfcln and Cn := UfeLn ^^k- 

The next proposition proves that ca(5') C ba(S'). The definition of the function a here will 
look formally like the definition in 14.1.16. The principal difference is that in the earlier example 
we were working with merely an algebra of sets and a function // which was assumed to be finitely 
additive and bounded. Here the domain is a u-algebra of sets; we assume that /u is countably 
additive and prove that it is bounded. 

16.1.8. Proposition. Every real measure is bounded. (That is, if fi is a real measure on a mea- 

m 

surable space (S", 21), then there exists a number M > such that \fj,{A)\ < M for all A C S.) 
Hint for proof. For each measurable subset ^ of S" define 

m 

a{A) := sup{fi{B) : B Q A} . 

Proving that cr{S) < oo shows that /i is bounded above. This may be accomplished in three steps. 
First show that if ^{A) = oo and if A = B^tlC, then either cr{B) = oo or cr(C) = oo. Next show that 
if cr{A) = oo, then there exists a decomposition A = B ^ C such that \fJ^{B)\ > 1 and cr{C) = oo. 
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(To do this show that if a subset of A is chosen to have measure greater than |^(^)| + 1, then 
its complement in A has measure greater than 1 in absolute value.) Finally, obtain inductively a 
disjoint family sets Bi, B2, ■ ■ ■ with \fi{Bn)\ > 1 for each n. Conclude that /i is bounded above. 

16.1.9. Proposition. //(S", 21) is a measurable space, then ca(S) is a vector subspace of the Riesz 
spaceha{S) (see example 14.1-17). 

We now turn to the problem of showing that the ordered vector space ca(5') is also a lattice 
(and therefore a Riesz space). According to exercise 14.1.2 it suffices to show that /x V exists in 
ca(S') for each real measure fi. The function a defined in 16.1.8 above is a real valued set function. 
We must show next that it is countably additive and is, in fact, the supremum of {/x, 0} in ca(5). 
Once we have obtained the desired expression for /i"*" = ^ V 0, it is an easy matter to derive similar 
expressions for and ^ f\v. (Compare this with 14.1.19.) 

16.1.10. Example. Let (5*, /u) be a measurable space. Then ca(S) is a Riesz space. Furthermore, 
if G ca(S'), then the following formulas hold for all measurable subsets A of S. 

(a) ti+{A) = sup{ii{B) : B CA}. 

(b) fi-{A) = -m^{^^{B)■. B C A}. 

(c) ip V i^){A) = sup{/x(B) + u{C) : A = B\SC}. 

(d) Ifi A i^){A) = mf{n{B) + i/(C) : A = B\SC}. 

Hint for proof. Let A = l+l^x-Bfc. Half of showing that the set function a (defined in 16.1.8 
above) is countably additive is proving YlT=i^i^k) < ^{^)- This is facilitated by the following 
procedure: given e > 0, choose for each k G N a set Ck Bk such that 

li{Ck) > a{Bk) - 

Look at /i(l+J^i Ck). 

16.2. Ideals in Riesz Spaces 

16.2.1. Definition. A subset yl of a Riesz space E is solid if x belongs to A whenever |x| < \a\ 
for some a G A. A solid linear subspace of E is an order ideal in E. 

16.2.2. Proposition. If x belongs to a solid subset A of a Riesz space, then \x\, x~^ , and x" all 

belong to A. 

16.2.3. Proposition. Every order ideal in a Riesz space E is a Riesz subspace of E. 

16.2.4. Example. Consider the Riesz space i3([0, 1]) of bounded real valued functions on the 
interval [0, 1]. The set C([0, 1]) of continuous functions on [0, 1] is a Riesz subspace of but not an 
order ideal in ;S([0, 1]). 

16.2.5. Example. Let X be a topological space. A linear subspace C of C{X) is a Riesz subspace 
of C{X) if and only if |/| belongs to C whenever / does. 

We can improve on example 16.1.10: not only is ca(S') a Riesz space contained in ba(S'), it is 
an order ideal in ba(S'). 

16.2.6. Example. The family ca(S) of all real measures on a measurable space (5*, 21) is an order 
ideal in the Riesz space ba(5'). 

Hint for proof. To show that ca(S') is solid it suffices to show that if 1/ G ca(S'), if < /i < i^, 
and if An t B, then ii{An) — ^ A*(-B). Explain. 

16.2.7. Proposition. If A is a nonempty solid subset of a Riesz space E and A'^ is closed under 
addition, then A is an order ideal in E. 

16.2.8. Proposition. // Ji and J2 are order ideals in a Riesz space E, then Ji + J2 is also an 
ideal and (Ji + J2)^ = Ji^ + ^2^- 
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Hint for proof. Use the Riesz decomposition lemma 14.1.30. 

16.2.9. Proposition. The kernel of every Riesz homomorphism between Riesz spaces is an order 
ideal. 

16.2.10. Proposition. Let J be an order ideal in a Riesz space E. If a and (3 are elements of the 
quotient vector space Ej J , then the following are equivalent: 

(a) There exist vectors x G a and y £ (3 such that x < y; 

(b) For every x £ a there exists y £ f3 such that x < y; and 

(c) For every x £ a and for every y £ (3 there exists j £ J such that y — x > j. 

16.2.11. Proposition. Let J be an ideal in a Riesz space E. In the quotient vector space Ej J 
define 

\x\ < [y] if and only if 3a £ [x] and 3b £ [y] such that a < b . 

This defines a partial ordering on E/J under which it becomes a Riesz space and the map n: E ^ 
E/J: X I—)- [x] is a Riesz homomorphism. 

The space E/J is the quotient Riesz space of by J and vr is the quotient map in the 
category RS of Riesz spaces and Riesz homomorphisms. The map vr is also the quotient map in 
the category RSP of Riesz spaces and positive linear maps. 

16.2.12. Example. The quotient map defined above is in fact a quotient map in the categorical 
sense (see definition 8.2.1) and, consequently, the quotient Riesz space is a quotient object in the 
category RS. 

16.2.13. Theorem (Fundamental quotient theorem for RS and RSP). Let E and F be Riesz 
spaces and J be an order ideal in E. Show that if T is a Riesz homomorphism from E to F and 
kerT 5 J, then there exists a unique Riesz homomorphism T: E/J^F which makes the following 
diagram commute. Show also that if T is a positive linear map from E to F and ker T D then 
there exists a unique positive linear map T: E/J — )• F which makes the diagram commute. 

E 

IT 

Y 

E/J ^F 

T 

Furthermore, in either case, T is injective if and only if ker T = J; and T is surjective if and only 
if T is. 

16.2.14. Example. Let A be a nonempty subset of a Riesz space. The smallest order ideal 
containing A, also known as the order ideal generated by A, is the intersection of all order ideals 
containing A. It will be denoted by Ja- The order ideal generated by a single element a of a Riesz 
space is the principal order ideal containing a. It will be denoted by Ja. 

16.2.15. Definition. Let X be a topological space. A linear map T: C{X) —?- C{X) is a multipli- 
cation operator if there exists a continuous function ^: X — )■ M such that {Tf ){x) = (l){x)f{x) 
for every / £ C{X) and every x £ X. 

16.2.16. Example. Let X be a topological space. If T: C{X) — )• C{X) is a linear transformation 
which maps every order ideal of C{X) into itself, then T is a multiplication operator. 

Hint for proof. Consider the image under T of the constant function 1 on X. 
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16.3. Bands in Riesz spaces 

16.3.1. Definition. An order ideal J in a Riesz space E \s a BAND \{ ^ ^ J whenever ^ C J and 
d. = sup A exists in E. 

16.3.2. Example. In the Riesz space C[a, b] the set {/: /(a) = 0} is an order ideal but not a band. 

16.3.3. Example. In the Riesz space C[a, h] the set of all functions / such that f{x) = for every 
X G [0, ^] is a band. 

16.3.4. Example. If S is a measurable space, then the family ca(S') of real measures on 5 is a 
band in the Riesz space ba(5) of all bounded finitely additive set functions on S. 

Proof. See [ '], theorem 9.55. 

16.3.5. Example. Let j4 be a nonempty subset of a Riesz space. The smallest band containing 
A, also known as the hand generated by A, is the intersection of all bands containing A. It will be 
denoted by B^. The band generated by a single element a of a Riesz space is the principal band 
containing a. It will be denoted by Ba- 

16.3.6. Proposition. Let J be an order ideal in a Riesz E, then the band generated by J is the 
set of all X & E which satisfy 

\x\ = sup A for some nonempty set A C J+ . 

Proof. See [ ], Theorem 7.8. In [3], Theorem 3.4, the proposition is stated in terms of nets, 
which we discuss in the next section 16.4. 

16.3.7. Example. If Bi and B2 are bands in a Riesz space, then Bi + B2 need not be a band. 
(Compare this with exercise 16.2.8.) 

Hint for proof. Look in C([— 1, 1]). 

16.4. Nets 

Nets (sometimes called generalized sequences) are useful for characterizing many properties 
of both topological spaces and Riesz spaces. They are used in fundamentally the same way as 
sequences are used in metric spaces. 

16.4.1. Definition. A preordered set in which every pair of elements has an upper bound is a 

DIRECTED SET. 

16.4.2. Example. If is a set, then Fin 5* is a directed set under inclusion C. 

16.4.3. Definition. Let 5 be a set and A be a directed set. A mapping x: A — )■ 5 is a net in S 

(or a net of members of S). The value of x at A G A is usually written x^ (rather than x{X)) and 
the net x itself is often denoted by i^^) x^a j^®* (^a)' 

16.4.4. Example. The most obvious examples of nets are sequences. These are functions whose 
domain is the (preordered) set N of natural numbers. 

16.4.5. Definition. A net x = {x^) is said to eventually have some specified property if there 
exists Ao G A such that x^ has that property whenever A > Aq; and it has the property frequently 
if for every Aq G A there exists A G A such that A > Aq and x^ has the property. (Compare these 
definitions with the ones given for sequences in 7.1.3 and 7.1.6.) 

16.4.6. Definition. A net x in a topological space X converges to a point a G A if it is 
eventually in every neighborhood of a. In this case a is the limit of x and we write x^ — > a or 
just x^ — 7- a. When limits are unique we may also use the notation lim^eA ~ more simply, 
lim x^ = a. 

The point a in A is a cluster point of the net x if x is frequently in every neighborhood of a. 
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16.4.7. Example. The sequence (1 — n ^) converges in M. 

16.4.8. Example. Let X be a topological space and a £ X. A BASE for the family of neighborhoods 
of the point a is a family iBa of neighborhoods of a with the property that every neighborhood of 
a contains at least one member of *Ba. In general, there are many choices for a neighborhood base 
at a point. Once such a base is chosen we refer to its members as basic neighborhoods of a. 

Let fBa be a base for the family of neighborhoods at a. Order 5Ba by containment (the reverse 
of inclusion); that is, for U, V & set 

U ii and only if [/ D y . 

This makes 55a into a directed set. Now (using the axiom of choice) choose one element from 
each set U G Q5a- Then (xjj) is a net in X and x^ a. 

The next two propositions assure us that in order for a net to converge to a point a in a 
topological space it is sufficient that it be eventually in every basic (or even subbasic) neighborhood 
of a. 

16.4.9. Proposition. Let ^ be a base for the topology on a topological space X and a be a point 
of X. A net (xx) converges to a if and only if it is eventually in every neighborhood of a which 
belongs to *B. 

16.4.10. Proposition. Let 6 be a subbase for the topology on a topological space X and a be a 
point of X. A net (xx) converges to a if and only if it is eventually in every neighborhood of a 
which belongs to &. 

16.4.11. Example. Let J = [a,b] be a fixed interval in the real line, x = {xo,Xi, . . . ,Xn) be an 
(n + l)-tuple of points of J, and t = {ti, . . . , t„) be an n-tuple. The pair (x; t) is a PARTITION WITH 

SELECTION of the interval J if: 

(a) Xk-i < Xk ior 1 < k < n; 

(b) tk G [xk-i,Xk] ioT l<k<n; 

(c) xo = a; and 

(d) Xn = b. 

The idea is that x partitions the interval into subintervals and tk is the point selected from the k^^ 
subinterval. If x = {xq, xi, . . . , Xn), then {x} denotes the set {xq,xi, . . . , Let P = (x; s) and 
Q = (y; t) be partitions (with selections) of the interval J. We write P =4 Q and say that Q is a 
REFINEMENT of P if {x} C {y } . Under the relation ^ the family *P of partitions with selection on 
J is a directed (but not partially ordered) set. 

Now suppose / is a bounded function on the interval J and P = (x; t) is a partition of J into 
n subintervals. Let Axk := Xk — Xk-i foi 1 < k < n. Then define 

n 

SfiP) :=^/(tfe)Axfe. 

k=l 

Each such sum Sf{P) is a RiEMANN SUM of / on J. Notice that since ^ is a directed set, Sf 
is a net of real numbers. If the net Sf of Riemann sums converges we say that the function / is 
RiEMANN INTEGRABLE. The limit of the net Sf (when it exists) is the RiEMANN INTEGRAL of / 

over the interval J and is denoted by f{x) dx. 

16.4.12. Example. In a topological space with the indiscrete topology every net in the space 
converges to every point of the space. 

As the preceding example shows nets in general topological spaces need not have unique limits. 
A space does not require very restrictive assumptions however to make this pathology go away. For 
limits to be unique it is sufficient to require that a space be Hausdorff. Interestingly, it turns out 
that this condition is also necessary. 
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16.4.13. Proposition. If X is a topological space, then the following are equivalent: 

(a) X is Hausdorff. 

(b) No net in X can converge to more than one point. 

(c) The diagonal in X x X is closed. 

(The DIAGONAL of a set S is {(s, s) : s e S} <^ S x S .) 

16.4.14. Example. Recall from beginning calculus that the n}^ partial sum of an infinite series 
YlT=i^k defined to be s„ = Yl^=i^k- The series is said to converge if the sequence of 
partial sums converges and, if the series converges, its sum is the limit of the sequence (sn) of 
partial sums. Thus, for example, the infinite series Yl'k^=i converges and its sum is 1. 

The idea of summing an infinite series depends heavily on the fact that the partial sums of 
the series are linearly ordered by inclusion. The "partial sums" of a net of numbers need not be 
partially ordered, so we need a new notion of summation. 

16.4.15. Definition. Let A C M. For every F £ Fin A define Sp to be the sum of the numbers 
in F, that is, Sp = J2 ^- Then S = {^f) p^p^^j^ is a net in M. If this net converges, the set A of 
numbers is said to be summable; the limit of the net is the SUM of A and is denoted by ^ A. 

16.4.16. Definition. A net (xx) of real numbers is increasing if A < implies xx < x^. A net 
of real numbers is BOUNDED if its range is. 

16.4.17. Example. Let f{n) = 2"" for every n G N, and let x: Fin(N) — )■ M be defined by 

(a) The net x is increasing. 

(b) The net x is bounded. 

(c) The net x converges to 1. 

(d) The set of numbers \, jq, ■ ■ ■} is summable and its sum is 1. 

16.4.18. Example. In a metric space every convergent sequence is bounded. Give an example to 
show that a convergent net in a metric space (even in M) need not be bounded. 

It is familiar from beginning calculus that bounded increasing sequences in M converge. The 
preceding example is a special case of a more general observation: bounded increasing nets in M 
converge. 

16.4.19. Proposition. Every bounded increasing net {^x)xeA '^f ^^^^ numbers converges and 

limx^ = sup{x^ : A G A} . 

16.4.20. Example. Let (xk) be a sequence of positive real numbers and A = {xk- k € N}. Then 
(as defined above) is equal to the sum of the series Y2T=i ^k- 

16.4.21. Example. The set : A; S N} is not summable and the infinite series 'Yl'k=i i does not 
converge. 

16.4.22. Example. The set |(— 1)'^| : k G N} is not summable but the infinite series Yl'k'=i^~^)^ \ 
does converge. 

16.4.23. Proposition. Every summable subset of real numbers is countable. 

Hint for proof. If j4 is a summable set of real numbers and n G N, how many members of A 
can be larger than 1/n? 

In proposition 16.4.13 we have already seen one example of a way in which nets characterize a 
topological property: a space is Hausdorff if and only if limits of nets in the space are unique (when 
they exist). Here is another topological property — continuity at a point — that can be conveniently 
characterized in terms of nets. 
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16.4.24. Proposition. Let X and Y be topological spaces. A function f : X ^ Y is continuous 
at a point a in X if and only if f{x^ — )• /(a) in Y whenever (x^) is a net converging to a in X. 
{Cf. exercise 11.2.6.) 

16.4.25. Example. Let y be a topological space with the weak topology (see definition 11.4.1) 
determined by an indexed family {fa)aeA of functions where for each a £ A the codomain of fa 
is a topological space Xa- Then a net {yx)x^j<^ ™ ^ converges to a point a G F if and only if 
faiVx) foi" every a e A. 

16.4.26. Example. If 1" = Hae^ ■^o' ^ product of nonempty topological spaces with the product 
topology (see definition 11.4.9), then a net (y^) in Y converges to a E y if and only if (y^)^ — )• Oo 
for every a £ A. 

16.4.27. Example. Let X and Y be nonempty topological spaces. Recall that the set Y-^ = 
F{X, Y) of all functions from X into y is a product (see example 8.3.7). Give it the usual product 
topology (see definition 11.4.9). A net (/^) of functions in Y-^ converges to a function g G Y^ if 
and only if f)^{x) — )• g{x) for each x € X. Thus the product topology on Y^ is usually called the 

TOPOLOGY OF POINTWISE CONVERGENCE. 

In metric spaces we characterized interiors and closures in terms of sequences. For general 
topological spaces we must replace sequences by nets. 

16.4.28. Proposition. A point a in a topological space X is in the interior of a subset S of X if 
and only if every net in X that converges to a is eventually in S. Hint. One direction is easy. In the 
other use exercise 16.4-9 and the sort of net constructed in example 16.4-8. ( Cf. proposition 10.5.9.) 

16.4.29. Proposition. A point b in a topological space X belongs to the closure of a subset S of X 
if and only if some net in S converges to b- ( Cf. exercise 10.5.22.) 

16.4.30. Proposition. A subset A of a topological space is closed if and only if the limit of every 
convergent net in A belongs to A. 

Nets (as well as sequences) are also useful in characterizing many properties of Riesz spaces. 

16.4.31. Definition. A net (xA)AeA in a Riesz space E is increasing if A < /i in A implies 
x\ < x^ in E. This is denoted by {x\) t- We write xa t ^ to indicate that the net {x\) is increasing 
and a = supjxA : A G A}. Decreasing nets and the notation x\ \, a are defined similarly. 

16.4.32. Proposition. Let {x\) and {y\) be nets (whose domains are the same directed set K) in 
a Riesz space. Show that if xx I a and y\ \, b, then xx + yx I a + b. 

16.4.33. Proposition. Let (xx) be a net in a Riesz space such that xx i a. If a > 0, then axx i eta 
( and if a < 0, then axx '[ aa). 

16.4.34. Proposition. Let (xx) and (yx) be nets (whose domains are the same directed set A) in 
a Riesz space. Show that if xx i a and yx i b, then xx^ yx i aV b and xx ^ yx i a ^b. 

16.4.35. Exercise. State and prove the facts about sequences which correspond to the facts about 
nets in the three preceding propositions. Also state and prove the dual of each of these propositions. 

16.4.36. Proposition. A Riesz space E is a-Dedekind complete if and only if every increasing 
sequence (xn) in E^ which is bounded above has a least upper bound. 

16.4.37. Proposition. A Riesz space E is Dedekind complete if and only if every increasing net 
(xa) in E^ which is bounded above has a least upper bound. 

16.4.38. Example. Let 21 be an algebra of subsets of a nonempty set S. Then the Riesz space 
ba(5) (see example 14.1.17) is Dedekind complete. 

Proof. See [31], example 23.3(v) or [49], example 12.5(iv). 
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16.4.39. Example. If {S, 21) is a nonempty measurable space, then ca(5) is a band in the Riesz 
space ba(5). 

Proof. See [.> .], example 25.3 or [49], theorem 27.3. 

16.4.40. Proposition. An order ideal in a Dedekind complete Riesz space is itself Dedekind com- 
plete. 

16.4.41. Example. If (S, 21) is a nonempty measurable space, then the Riesz space ca(5') is 
Dedekind complete. 

16.4.42. Proposition. Let v be an element in a Riesz space E. The principal order ideal Jy 
generated by v is given by 

Jv = {x £ E : \x\ < a\v\for some a > 0}. 

16.4.43. Proposition. Let v be an element in a Riesz space E. The principal band By generated 
by V is given by 

By = {x e E: \x\ A n\v\ t 

(where n 

16.5. Disjointness in Riesz Spaces 

16.5.1. Definition. In a Riesz space two elements x and y are disjoint (or orthogonal, or 
MUTUALLY SINGULAR) if |a;| A \y\ = 0. In this case we write x ±y. 

16.5.2. Proposition. Let x, y, and z be elements of a Riesz space. 

(a) If y -L z and \x\ < \y\, then x _L z. 

(b) If X ± y and a G M, then ax _L y. 

16.5.3. Proposition. Let E be a Riesz space. 

(a) If X, y, z G E, then 

\{x V z) - {y V z)\ + \ {x A z) - {y A z)\ = \x - y \ . 

(b) If u, V, w G E'^ , then 

{u + v) Aw < {u A w) + {v A w) . 

(c) // u, V, w G E^ and u J- v, then 

{u + v) A w = {u A w) + {v A w) . 

(d) If X J- z and y -L z in E, then x + y -L z. 

16.5.4. Proposition. Let E be a Riesz space. 

(a) X J- y in E if and only if x~^ -L y and x~ _L y. 

(b) Let D Q E and suppose that H. = \J D exists (in E). Then 

i+ = \J{x+: X G D} and = /\{x- : x G D} . 

(c) Let D E and suppose that i = \/ D exists (in E). If y ± x for all x G D, then y -L £. 

16.5.5. Exercise. Let S be a Riesz space. 

(a) If x -L y in E, then 

1 1^1 ~ |y| I = 1^ + y| = 1^ ~ 2/1 = l^^l + \y\ = \x\ v \y\ . 

(b) If |x + y| = |x — y\ in E, then x _L y. 

(c) If Ix + y| = |x| V \y\ in E, then x -L y. 

(d) If \x — y\ = \x\ V |y| in E, then x _L y. 
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16.5.6. Definition. Let A be a nonempty subset of a Riesz space E. The DISJOINT complement 
of A, denoted by A'^, is the set of all elements of E that are disjoint from every member of A. That 
is, 

A'^ := {x €z E : x _L a for every a G A} . 

16.5.7. Proposition. Let A C B Q E where E is a Riesz space. Then 

(a) AnA'^C {0}, 

(b) B'^ C A'^, 

(c) ACA'^'^, and 

(d) A'^ = A'^^'^. 

16.5.8. Example. If ^ is a subset of a Riesz space E, then A"^ is a band in E. 

16.5.9. Proposition. If Ji and J2 are order ideals in a Riesz space E such that E = Ji© J2, then 
J2 = Ji^ and Ji = ■ Thus Ji and J2 are bands. 

16.5.10. Definition. Two real measures /j, and on a measurable space (S, 21) are mutually 
SINGULAR if they are disjoint in the Riesz space ca(S'); that is, if |//| A = 0. We extend 
this terminology to complex and positive measures: two such measures are mutually singular if 

A 1 1^1 = 0. If ;U and v are mutually singular measures we write /i _L i/. It is possible to give a 
very simple necessary and sufficient condition for the mutual singularity of measures. 

16.5.11. Proposition. If fi and v are real measures on a measurable space (5,21), then fi J- v if 
and only if there exists a measurable subset D of S such that 

|/.|(D) = IH(i?^) = o. 

Hint for proof It is enough to consider positive real measures. Suppose fi A u = 0. Then by 
example 16.1.10 it is possible to choose a measurable set A so that ^{A) + i'{A^) is arbitrarily small. 

Use this fact to show that for each e > there exists B (Z S such that fJ.{B) = and v{B'^) < e. 
Use this in turn to find a set D such that fJ-{D) = and v{D'^) = 0. 

m 

16.5.12. Definition. Let be a real measure on a measure space S and P ^ S. We say that P 
is a POSITIVE SET for /i if ^{A) > for every measurable subset A of P. Similarly, if ^{A) < for 

m 

every A O P, then P is a negative set for fj,. 

If P is a positive set for the real measure /i and if P'^ is a negative set for fi, then the pair 
(P, P'^) is a Hahn decomposition of S. If a Hahn decomposition of a real measure space (S, fi) 
is known, then it becomes a very simple matter to compute the Jordan decomposition of fj,. To 
determine the action of on a measurable set A, intersect A with P and apply n ; to find iJ,~{A), 
apply -fi to An P". 

16.5.13. Proposition. If{P,P'^) is a Hahn decomposition of a real measure space (5*, /x), then for 

m 

every A C S" 

fi+{A) = ^,{AnP) 

and 

^i-{A) = -^{AnP'). 

Notice the following very simple corollary: If (P, P"^) is a Hahn decomposition of a real measure 
space, then ^^[B) = whenever B Q P and ^'^{B) = whenever B C P'^. 

16.5.14. Example. Let (S", 21, /i) be a positive real measure space such that /x(5) = 1 and let 
u = ^ — 5h where 5h is Dirac measure on (S, 21) concentrated at some point b £ S. 

(a) If P = 5 \ {6}, then the pair (P, P^) is a Hahn decomposition of S induced by v. 

(b) The Jordan decomposition = — is given by i'~^{A) = ^{A \ {b} ) and i^~iA) = 
1 - iJ,i{b}) if 6 G ^ and i^'iA) = otherwise. 
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(c) Suppose further that c is a member of S different from b such that /i({c}) = 0. Then 
{Q, Q^) is a second Hahn decomposition of S when = {6, c}. 

As the previous example indicates, Hahn decompositions are not, in general, unique. However, 
they are "almost" unique. 

16.5.15. Proposition. // {P,P^) and {Q,Q'^) are Hahn decompositions of a real measure space 
{S,fj,), then 

|^|(PAQ) = 0. 

In light of the convenience provided by a Hahn decomposition in computing /i"'" and fi~, it is 
gratifying to know that such a decomposition always exists. 

16.5.16. Proposition. Every real measure space has a Hahn decomposition. 

16.5.17. Definition. If ^ is a real measure on a measurable space S, then the corresponding 
positive real measure |/i| = /i"*" + ii~ is called the total variation of /i. As is true of and fi~ , 
it is possible to give a simple characterization of the total variation measure. 

16.5.18. Proposition. If n is a real measure on a measurable space S, then for each measurable 
subset A of S 

\p.\{A) = ^nv\Y,UBk)\:A=\^BA. 

^k=l k=l ^ 

Hint for proof. The inequality in one direction can be obtained by showing that: 

\^\{A) = HAnP)\ + \fi{AnP')\ 

m 

where A <^ S and (P, P^) is a Hahn decomposition of S. 

16.5.19. Example. Let 21 be an algebra of subsets of a set S. For /u G ba(5, 21)) define 

M\ = \fi\{S) (= sup{|H(A):ylG2l}). 

This makes ba(S') into a real normed linear space. If 21 is a fi-algebra of subsets of S, then ca(5, 21) 
is also a real normed linear space. Notice that this is not the same norm given in example 12.1.18. 

16.6. Absolute Continuity 

16.6.1. Definition. If J is any interval in M a real (or complex) valued function / on J is abso- 
lutely CONTINUOUS if it satisfies the following condition: 

for every e > there exists 5 > such that if (ai, 6i), . . . , (a„, ?)n) are dis- 
joint nonempty subintervals of J with J2k=ii^k — cik) < then X^fc=il/(^fc) — 
f{ak)\ < e. 

16.6.2. Example. The function x i— )■ ^/x is absolutely continuous on [0,oo). 

It is obvious that an absolutely continuous function on an interval in M is uniformly continuous. 
But the converse is not true. 

16.6.3. Example. The function /: [0,1] — ?• M defined by f{x) = j;sinQ) for < x < 1 and 
/(O) = is uniformly continuous but not absolutely continuous. 

Hint for proof. The easiest way to see that / is uniformly continuous is to make use of two 
results we have not yet proved, 17.1.29 and 17.1.21. 

16.6.4. Definition. If fi and z/ are real measures on a measurable space, we say that n is abso- 
lutely CONTINUOUS WITH RESPECT TO z/, and write /X ^ z/, if |z^|(A) = implies fi{A) = 0. 

16.6.5. Proposition. // /i and v are real measures on a measurable space S, then the following 
are equivalent: 
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(a) jJ. <C I'; 

(b) <C and /x~ <C and 

(c) \^^\ < 

16.6.6. Proposition. Let ji, v, and p be real measures on a measurable space S. 

(a) iJ, <^ jjL. 

(b) If p <^ u and v <^ p, then p <^ p. 

(c) If < < p, then v <^ p. 

(d) If p<^0, then p = 0. 

(e) If p <^ p and v <^ p, then p + ly <^ p. 

(e) If p-if > 0, then u ^ p + u. 

(f) If u <^ p and p -\- p, then _L p. 

The next result may help explain the connection between absolute continuity of functions and 
absolute continuity of measures. 

16.6.7. Proposition. Suppose that v is a measure and p is a complex measure on a measurable 
space S. Then the following are equivalent: 

(a) p <^ u; and 

m 

(b) For every e > there exists 5 > such that \p{A)\ < e whenever ACS and i'{A) < 6. 

16.6.8. Proposition. Let p and v be positive real measures on a measurable space S. Then the 

following are equivalent: 

(a) p <C I'; 

(b) if (An) is a sequence of measurable subsets of S such that viAn) — >■ as n — >■ oo, then 

p{An) — T- as n — t- oo; and 

m 

(c) for every e > there exists 6 > such that p{A) < e whenever ACS and i^{A) < 6. 
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COMPACT SPACES 

17.1. Compactness 

17.1.1. Definition. Let iX be a family of sets and 5 be a set. Recall that if — then we 
say that il covers S, or that ii is a cover for S, or that 11 is a covering of S. If a cover il for a 
topological space X consists entirely of open subsets of M, then iX is an open cover for X. If il 
is a family of sets which covers a set S and 03 is a subfamily of il which also covers S, then QJ is 
a SUBCOVER of il for S. A topological space X is compact if every open cover of X has a finite 
subcover. 

We have just defined what we mean by a compact space. It will be convenient also to speak of 
a compact subset of a topological space X. If X C X, we say that i^T is a compact subset of X 
if, regarded as a subspace of X, it is a compact topological space. 

Suppose we wish to show that some particular subset K of a topological space X is compact. Is 
it necessary that we work with coverings made up of open subsets of K (as the definition demands) 
or can we just as well use coverings whose members are open subsets of X? The answer is that, 
happily, either will do. In other words compactness is an intrinsic property of a set. (Recall the 
cautionary note at the beginning of section 11.5.1.) 

17.1.2. Proposition. A subset K of a topological space X is compact if and only if every cover of 
K by open subsets of X has a finite subcover. 

The class of compact Hausdorff spaces together with the continuous maps between them con- 
stitute an important category. We denote this category by CpH. 

Compactness in metric spaces can be characterized in terms sequences. A metric space is said 
to be sequentially compact if every sequence in the space has a convergent subsequence. 

17.1.3. Proposition. A metric space is compact if and only if it is sequentially compact. 
Proof. See [17], theorem 16.2.1. 

17.1.4. Proposition. In a metric space every compact set is bounded. 

17.1.5. Example. Every finite subset of a topological space is compact. 

17.1.6. Example. Let x be a convergent sequence in a topological space X and a = limx,i. Then 
ranx U {a} is a compact subset of X. 

An important fact about compactness is that it is F-hereditary. 

17.1.7. Proposition. Every closed subset of a compact topological space is compact. 

17.1.8. Proposition. In a Hausdorff topological space every compact set is closed. 

17.1.9. Example. In the preceding proposition the hypothesis that the space be Hausdorff is 
necessary. 

17.1.10. Definition. A family J" of sets is said to have the finite intersection property if 
every finite subfamily of J- has nonempty intersection. 

17.1.11. Proposition. A topological space X is compact if and only if every family of closed 
subsets of X with the finite intersection property has nonempty intersection. 
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17.1.12. Proposition. Let (i^^) be a decreasing sequence of nonempty closed subsets of a compact 
topological space. Then HfeLi nonempty. 

The next result is fundamental: the continuous image of a compact set is compact. 

17.1.13. Proposition. If f : X ^ Y is a continuous function between topological spaces and X is 
compact, then ran / is compact. 

17.1.14. Proposition. In the category CpH every epimorphism is a quotient map. 
Compare the preceding proposition with example 11.6.6. 

17.1.15. Proposition. Every continuous bijection from a compact space to a HausdorfJ space is 
a homeomorphism. 

Recall that we have seen in example 11.1.10 that in the category TOP bijective morphisms 
need not be homeomorphisms. However . . . 

17.1.16. Corollary. In the category CpH bijective morphisms are isomorphisms. 

The preceding corollary and proposition 17.1.14 help to explain analysts' affection for compact 
Hausdorff spaces. 

17.1.17. Example, let X = [0, 2tt] and let ~ be the equivalence relation which identifies and 2ir. 
(That is, the only distinct elements which are equivalent are and 27r.) Show that the resulting 
quotient space X/~ (see exercise 11.6.4) is (homeomorphic to) the unit circle T. 

17.1.18. Theorem (Extreme value theorem). Every continuous real valued function on a compact 
topological space assumes a maximum and a minimum value somewhere on the space. 

17.1.19. Theorem (Dini's Theorem). If a decreasing sequence (fn) of real valued functions on an 
interval [a, b] in M converges pointwise to zero, then it converges uniformly to zero. 

Hint for proof. Given e > let An ■= /n~"'^([e, oo)). Argue that since the family {An', n G N} 
has empty intersection, it cannot have the finite intersection property. 

17.1.20. Exercise. Generalize Dini's theorem to real valued functions on a compact Hausdorff 
space X. Also see if you can weaken the hypotheses: instead of assuming that a decreasing sequence 
(/n) of continuous functions converges pointwise to zero, assume instead that is a family of 
continuous functions such that 

(i) the infimum of any two members of J- dominates some third member of J- and 

(ii) the function g: x ^ inf{/(2;) : / £ J^} (where x G X) is real valued and continuous. 

17.1.21. Proposition. Every continuous function between metric spaces whose domain is compact 
is uniformly continuous. 

17.1.22. Exercise. Show that the interval [0,oo) is not compact using each of the following: 

(a) the definition of compactness (see 17.1.1); 

(b) the extreme value theorem (see 17.1.18); 

(c) proposition 17.1.21; 

(d) the finite intersection property (see 17.1.11); 

(e) Dini's theorem (see!l7.1.19). 

17.1.23. Exercise. Using only the definition of compactness prove that the closed unit interval 
[0, 1] is compact. 

17.1.24. Exercise. Let A, B <^ M where {M,d) is a metric space. Prove or disprove: 

(a) If A and B are disjoint closed sets, then d{A, B) > 0. 

(b) If A and B are disjoint sets, if A is compact, and if B is closed, then d{A, B) > 0. 

(c) If A and B are disjoint compact sets, then d{A, B) > 0. 
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(d) If A and B are closed sets, then there exist points a £ A and b £ B such that d{A, B) = 
d{a,b). 

(e) If A is compact and B is closed, then there exist points a £ A and b £ B such that 
d{A,B) = d{a,b). 

(f ) If A and B are compact sets, then there exist points a £ A and b £ B such that d{ A, B) = 
d{a,b). 

(Recall that in a metric space the distance d{A, B) between two sets A and B is defined to be 
inf{d(a, b): a£ A and b £ B}. 

17.1.25. Proposition. Let Ti and T2 &e topologies on a set X with Ti weaker than T2. // the 
space (X, Ti) is HausdorjJ and the space (X, T2) is compact, then the two topologies are equal. 

17.1.26. Exercise. In each of the following sentences, fill in the blank with either the word 
"weakened" or the word "strengthened" to make a correct assertion. 

(a) A Hausdorff space remains Hausdorff if its topology is . 

(b) A continuous function f : X ^ Y remains continuous if the topology on X is . 

(c) A continuous function f : X ^ Y remains continuous if the topology on Y is . 

(d) A convergent sequence in a topological space remains convergent if the topology on the 
space is . 

(e) A compact space remains compact if its topology is . 

17.1.27. Theorem (Tychonoff). Let {X^: a £ A} be a family of topological spaces. Their product 
Y\ Xa is compact if and only if each factor space X^ is compact. 

Tychonoff proved this theorem in 1930 for the special case where each X^ is the unit interval; 
Cech published the proof of the general case in 1937. 

Proofs of Tychonoff 's theorem highlight the important distinction between "elementary proofs" 
and "easy proofs" . The short and easy proofs of this theorem all make use of somewhat advanced 
concepts: Aliprantis and Border [ '] (theorem 2.57), Dugundji [1 ^] (theorem XI. 1.4(4)), and Wilan- 
sky [47] (theorem7.4.1) employ the machinery of ultrafilters, while Willard [-48] (theorem 17.8) 
uses universal nets. An elementary (but longer) proof appears in Lang [28] (chapter 2, theorem 
3.12). Perhaps the most approachable compromise is the route taken by Rudin [38] (theorem A. 3) 
and Hewitt and Stromberg [23] (theorem 6.43) where the principal tool is the Alexander subbase 
theorem, which says that a topological space is compact if and only if every cover by subbasic open 
sets has a finite subcover. Whatever path you choose, somewhere you will need some version of the 
axiom of choice. (Kelley proved in 1950 that Tychonoff 's theorem implies the axiom of choice.) 

17.1.28. Proposition. If K and L are compact subsets of a normed linear space, then K + L is 
compact. 

17.1.29. Theorem (Heine-Borel theorem). A subset ofW^ is compact if and only if it is closed 
and bounded. 

17.1.30. Proposition. Any two norms on are equivalent. 

17.1.31. Example (Cantor set). Define a subset C of the unit interval [0,1], called the Cantor 
set, as follows. Let Fi be [0, 1] with its open middle third (|, |) removed. So Fi consists of two 
closed intervals. Let F2 be -Fi with the open middle thirds of both intervals removed. Proceed 
inductively. Having defined the set Fn, which consists of 2" closed intervals, let -F^+i be Fn with 
the open middle third of each of its constituent intervals removed. Finally, let C — C\'^_-^Fn. Then 



U=i J 

(b) C consists of all points of [0, 1] which have at least one ternary expansion not containing 

the digit 1: 

(c) C is a compact metric space; 
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(d) C is uncountable; and 

(e) C is homeomorphic to the product space {0, 1}^ (where {0, 1} is a two-point space with 
the discrete topology). 

Hint for proof. Suppose that x is a number in the interval [0, 1] and x = X^^^ af^3~^, where 
each Ofc is 0, 1, or 2. Then we may adopt a shorthand notation x = 0.201020304 • • • • This is the 
TERNARY EXPANSION of X. As is the case with decimal expansions, ternary expansions need not 
be unique. For example, | = 0.^1000 • • • = 0.^0222 .... For part (d) consider the function from the 

Cantor set C into [0, 1] whose value at a point YlT=i '^k'^~^ (each Ofc being or 2) is X^fcli flfc2^^'^^^^ 
We will call this function the Cantor function. This function is surjective; but be careful — it is 
not quite one-to-one; its values at the endpoints of each interval removed in the construction of the 
Cantor set are equal. (You can find careful discussions of the elementary properties of the Cantor 
set in pages 26-29 of [10] and in section 2.5 of [32].) 

17.1.32. CAUTION. Many authors assign the name Cantor function to the extension of the 
function defined above to the entire interval [0, 1]. (This is accomplished by assigning the function 
a constant value on each of the open intervals removed in the construction of the Cantor set in 
such a way that the resulting function is increasing. See example 22.5.1 for details.) We will call 
this extension the Lebesgue singular function. 

17.1.33. Definition. An isolated point of a subset yl of a topological space X is a point 
belonging to A which is not an accumulation point of A. A closed subset of X is said to be 
PERFECT if it contains no isolated points; that is, if ^ = ^4'. 

17.1.34. Example. At each stage in the construction of the Cantor set we removed the middle 
third of each remaining closed interval. The fraction one-third is a bit arbitrary. We may if we 
wish pursue essentially the same procedure but (at each stage) remove an open interval of some 
other length from the center of each remaining closed interval. This will result in sets known as 
Cantor-like sets. A detailed description of this process, together with a proof that such a set 
is compact, perfect and has empty interior, can be found in [23], pages 70-71. 

In this context two more very interesting results are proved in [23]. The first generalizes the 
fact (see example 17.1.31(d)) that the Cantor set is uncountable. 

17.1.35. Proposition. If A is a nonempty perfect subset of a complete metric space, then card^ > 
cardM. 

Proof. See [23], theorem 6.65. 

The second result says that closed subsets of second countable spaces are "almost" perfect. 

17.1.36. Proposition (Cantor-Bendixson). If A is a closed subset of a second countable topological 
space X , then there exists a perfect subset P of X contained in A such that A\P is countable. 

Proof. See [23], theorem 6.66. 

17.2. Local Compactness 

17.2.1. Definition. A topological space is locally compact if every point in the space has a 
neighborhood whose closure is compact. 

17.2.2. Example. Every compact space is locally compact. 

17.2.3. Example. For each n G N Euclidean n-space M" is locally compact but not compact. 

17.2.4. Example. The space Q of rational numbers is not locally compact. 

17.2.5. Notation. Let A and B be subsets of a topological space. We write A ~< B \i ACL B° . 

17.2.6. Proposition. The following hold for subsets A, B, and C of a topological space: 
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(a) if A B and B C, then A -< C; 

(b) if A-<C and B C, then Au B C; and 

(c) if A<B and A<C, then A-< Br\C. 

Starting in section 21.4 we study so-called normal topological spaces. The class of all such 
spaces is important because it contains all metric spaces and all compact Hausdorff spaces. One 
very important result for these spaces is Urysohn's lemma concerning the separation of a pair of 
disjoint closed sets by a continuous function (see theorem 21.5.9). Unfortunately not every locally 
compact Hausdorff space is normal. The next three results 17.2.7, 17.2.8, and 17.2.10 give us a 
version of Urysohn's lemma for locally compact Hausdorff spaces. 

It is not easy to dream up an example of a locally compact Hausdorff space which is not normal. 
There is, however, a standard example, the Tychonoff plank. You can find details in [ ], page 145, 
example 4 and page 239, example 2 or [47], section 14.6. 

17.2.7. Proposition. Let X be a locally compact Hausdorff space, x G X , and V be a neighborhood 
of X. Then there exists a neighborhood U of x with compact closure such that U ~<V. 

Proof. See [23], theorem 6.78 or [ ], proposition 7.1.2. There is also a proof in [ ~], propo- 
sition 7.22(a), but it makes use of facts about normal topological spaces. 

17.2.8. Proposition. Let X be a locally compact Hausdorff space, K a compact subset of X, and 
V be an open subset of X such that K C V .Then there exists an open set U in X with compact 
closure such that K <U <V . 

Proof. See [■^"'], theorem 6.79; [ '], proposition 7.1.3; or [ ], proposition 7.22(b). 

17.2.9. Definition. We will say that disjoint sets A and B in topological space X can be func- 
tionally SEPARATED if there exists a continuous function / : X — ^ [0, 1] such that f^{A) = {0} 
and f-'iB) = {1}. 

17.2.10. Theorem (Urysohn's lemma for locally compact Hausdorff spaces). Let X be a locally 
compact Hausdorff space, K be a compact subset of X, and C be a closed subset of X . If K and C 
are disjoint, then they can be functionally separated. 

Proof. See [-.->], theorem 6.80. Again there is a proof in [ ], theorem 7.14, but it makes use 
of facts about normal spaces. 

17.2.11. Definition. We say that a family T of continuous functions on a topological space X 
SEPARATES POINTS of X (or is a SEPARATING FAMILY of functions on X) if for every pair of distinct 
points X, y £ X there exists f £ T such that f(x) ^ f{y)- (Notice that there is a slight (and 
unimportant) difference between separating points x and y and separating the sets {x} and {y}.) 

17.2.12. Proposition. Let X be a compact topological space. Then C{X) is separating if and only 
if X is Hausdorff. 

17.2.13. Definition. A topological space {X, T) is metrizable if its topology T is the one induced 
by some metric on the set X. (See remark 10.4.14.) 

17.2.14. Proposition. Every second countable locally compact Hausdorff space is metrizable. 
Proof. See [32], theorem 7.16. 

17.3. Compactifications 

17.3.1. Definition. Let X and Y be topological spaces. A homeomorphism f : X ^ Y from X 
onto a subspace of y is a (topological) embedding and we say that X is embedded in Y. 
A COMPACTIFICATION of a Hausdorff topological space X is an ordered pair (K, cp) where is a 
compact Hausdorff space and (j) is an embedding of X into K. 
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17.3.2. CAUTION. This definition is not entirely standard. Many authors require that for {K, (p) 
to be a compactification of X the range of (j) must be dense in K. 

17.3.3. Proposition. Every locally compact HausdorjJ space X has a compactification X such that 
X \ X consists of a single point. 

Proof. Let X := X L) {oo} where oo is any point which does not belong to X. Declare the 
open subsets of X to be the open subsets of X together with the complements in X of compact 
subsets of X. It is easy to show that this family of subsets does indeed comprise a topology on X 
and that with this topology X is a compact Hausdorff space. □ 

The space X defined above is the one-point compactification (or the Alexandroff com- 
pactification) of X. The single point oo which we have adjoined to X is often called, somewhat 
fancifully, the point at infinity. There is nothing particularly mysterious or infinite about this 
point. It is, as we have said, just any point not in X. Can we be sure that there is such a point? 
Certainly. We can use the set X itself. Wilansky in [47] repeats the perhaps apocryphal story 
that some mathematician once told his students in response to this question, "/ am the point at 
infinity." 

17.3.4. Definition. There is some disagreement about terminology here. If one defines compactifi- 
cation in a way that requires the original space to be dense in its compactification (see 17.3.2), then 
it seems inappropriate to call X a compactification of X when X is compact to begin with. For 
in this case the point oo is clearly an isolated point of X so that X is not dense in X. Thus some 
writers restrict the terminology of one-point compactification to spaces X which are noncompact 
locally compact Hausdorff spaces. 

17.3.5. Example. The interval [0, 1] is the one-point compactification of (0, 1]. 

17.3.6. Example. The unit circle is the one-point compactification of M. More generally, the 
n-sphere S" is the one-point compactification of M" . 

17.3.7. Definition. Here is a list of some of the separation axioms which may be enjoyed by a 
topological space X. 

(i) X is a Ti space if for every pair x and y of distinct points in X there exist a neighborhood 
of X which does not contain y and a neighborhood of y which does not contain x. 

(ii) X is a r2 space if it is Hausdorff. 

(iii) A is a regular space if closed sets and points not belonging to the sets can be separated 
by open sets. 

(iv) A is a T3 space if it is Ti and regular. 

(v) A is a completely regular space if closed sets and points not belonging to the sets 
can be functionally separated. 

(vi) A is a Tychonoff space (or a T31 space) if it is Ti and completely regular. 

17.3.8. Proposition. A topological space X is Ti if and only if every finite subset of X is closed. 

17.3.9. Proposition. The following implications hold for topological spaces: 

T..1 ^ T3 ^ T2 ^ Ti. 

17.3.10. Proposition. Every locally compact Hausdorff space is a Tychonoff space. 
Proof. See [..:o], theorem 19.3. 

17.3.11. Theorem (Stone-Cech compactification). Let X be a Tychonoff topological space. Then 
there exists a unique (up to homeomorphism) compactification /3(A) of X which satisfies the fol- 
lowing condition: if K is a compact Hausdorff space, then every continuous function f : X ^ K 
has a continuous extension to /3(A). 
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Proof. See [1], section 2.17; [ ■ theorem 8.2; [34], proposition 4.3.18; or [ ' theorem 19.12. 

17.3.12. Remark. To appreciate how remarkable the Stone-Cech compactification P{X) is, con- 
sider the problem of extending the function 

/: (0,1] ^M: x^sin(^) 

to a continuous function on some compact set in which (0, 1] is dense. 
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LEBESGUE MEASURE 

18.1. Positive Measures 

Many authors refer to positive measures simply as measures. In these notes the word "measure" 
encompasses real measures, positive measures, complex measures, and signed measures, although 
little will be said about the last of these. Many facts about positive measures are quite similar to the 
facts about real measures presented in chapter 16, but often an additional hypothesis concerning 
the finiteness of the measure of some set is required. 

First we observe that positive measures are monotone. Real measures, complex measures, and 
signed measures need not be monotone. 

m 

18.1.1. Proposition. Let ^ he a positive measure on a set S . If A, B S and A (1 B, then 
fi{A)<is{B). 

The next proposition says that positive measures are countably subadditive. 

18.1.2. Proposition. If fi is a positive measure on a measurable space S and if Ai, A2, . . . are 

measurable subsets of S, then 

(00 \ 00 
[JaA < J^M^fe). 
k=l ^ k=l 

m 

18.1.3. Proposition. Let be a positive measure on a measurable space S. If A, B CI S, if 

AC B, and if fj.{A) < 00, then fi{B \A) = fj.{B) - fi{A). 

(Compare this with proposition 9.4.4.) 

771 

18.1.4. Proposition. Let ji be a positive measure on a measurable space S. If A, B C S, and if 
fi{A nB) <oo, then 

fi{A \JB) = fi{A) + fi{B) - fi{A n B) . 
(Compare this with proposition 9.4.3.) 

18.1.5. Proposition. Let S be a measurable space, (An) be a sequence of measurable subsets of S, 
and ^ be a positive measure on S. 

(a) If An t B, then ^(^„) ^ ^i{B). 

(b) If An iB and if ^{Ai) < 00, then fi{An) — )• IJ-{B). 

(Compare this with proposition 16.1.5.) 

18.1.6. Example. Let 5 be a nonempty set and 2t = *P(S'). Foi A C S let fi{A) = card A if A 
is finite and fi{A) = 00 if j4 is infinite. Then {S, 21, fi) is a positive measure space. The positive 
measure fi is called counting measure on S. 

18.1.7. Example. Give an example to show that the second hypothesis in part (b) of proposi- 
tion 18.1.5 is necessary. 

18.1.8. Example. Let S be an uncountable set and 21 = {A C S: A or A^ is countable}. For 
^4 G 21 let IJ,{A) = if j4 is countable and fJ,{A) = 1 if A is uncountable. Then (5, 21, jj.) is a positive 
(real) measure space. 
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18.1.9. Proposition. Suppose that {An) is a sequence of measurable subsets of a positive measure 
space {S, fi). Then 

(a) //(liminf yl„) < lira, ini fi{An). 

(b) If iJ,{U^^iAn) < oo, then /i(limsup An) > limsup fi{ An). 

(c) If fJ,{L)'^^iAn) < oo and lira. An exists, then lim ^(An) exists and fi(limAn) = lim/i(A„). 

18.1.10. Definition. If (5, 21, /i) is any type of measure space we say that some property P{x) of 
an element x in 5 holds almost everywhere or that property P holds for almost all x £ S if 



We use the abbreviation ^-a.e. (or just a.e.) for almost everywhere. Thus, for example, f = g a.e. 
means that /(x) = g{x) for almost all x (in the measure space 5) and for a sequence (/„) of functions 
on S, the notation fn ^ 9 (a.e.) means that fn{x) — ?• g{x) for almost all x in S. Whenever we 
claim that something happens almost everywhere on a subset of M" without specifying a measure, 
Lebesgue measure is understood. 

18.1.11. Proposition (The Borel-Cantelli lemma). Let {An) be a sequence of measurable subsets 
of a positive measure space {S, //) such that X^fcLi /"(^fc) < oo. Then almost all x lie in only finitely 
many of the sets A^. 

18.1.12. Definition. A point a in a positive measure space {X,fj,) is an atom of X if fi{{a}) > 0. 

18.1.13. Proposition. Let {X,fj,) be a positive real measure space and A be the set of atoms of X. 
Then there exist positive real measures Hd and /ic on X such that {X, ^d) is a discrete measure 
space, {X,fj,c) is a measure space with no atoms, and fj, = fi^ + f^c- Show also that n has at most 
countably many atoms and that 



(where 6a is Dirac measure concentrated at a). 

The measure fid is the discrete part of fi and fic is the CONTINUOUS PART of fi. 

18.1.14. Definition. A positive measure space {S,fi) (or its measure fi) is said to be a-FlNlTE if 
S is the disjoint union of a countable collection of measurable subsets each having finite measure. 

18.1.15. Exercise. Show that proposition 18.1.13 can be generalized to c-finite positive measures. 



Positive measures are designed to be generalizations of such things as length on the real line, 
area in the plane, and volume in 3-space. Ideally we would like to have on M, for example, a positive 
measure on *p(M) which is translation invariant (see definition 18.3.6) and has the property that the 
measure of the unit interval is 1. Unfortunately this cannot be achieved (see proposition 18.3.12). 
We can scarcely give up positivity, countable additivity, translation invariance, or the correct 
measure of interval length; so we have to settle for a positive measure defined on some proper 
subset of the power set *p(M). Lebesgue measure on M is the best we can do. It is defined on a 
quite large subset of *13(]R), a set in fact so large that finding a set not in the domain of Lebesgue 
measure requires the use of the axiom of choice. The price we pay for this success is the appearance 
of a number of technical details, which, depending on one's proclivities, are either fascinating or 
annoying. The next two sections discuss the construction of Lebesgue measure. 

In the current section we give one answer to the rather general question: where do positive 
measures come from? Briefly stated (in terms to be defined shortly): start with a gauge i on a 
sequential covering class of subsets of a set S; this produces an outer measure mi on the power set 
*P(5), which in turn produces a positive measure on the cr-algebra of mi-measurable subsets of S. 

18.2.1. Definition. A family £ of subsets of a set S is a sequential covering CLASS for S if 



fi{{x G S: P{x) does not hold}) = 0. 
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(i) G e: and 

(ii) S = UfcLi C'/fc for some sequence (C^) of sets in C 

18.2.2. Example. The family of bounded open intervals in M is a sequential covering class for R. 

18.2.3. Definition. Let (t he a sequential covering class for a set S. A function £: C ^ [0, oo] 
such that £(0) = is a gauge on S. (This is clearly an abuse of language: we should say that i is 
a gauge on £). 

An outer measure is a positive, increasing, countably subadditive function on the power set of 
a set whose value at is 0. 

18.2.4. Definition. Let S be a nonempty set. A function m : ^{S) — >■ [0, oo] is an OUTER MEASURE 
on S if: 

(i) m(0) = 0; 

(ii) m{A) < m{B) whenever A ^ B C. S; and 

(iii) m (Ufcli ^k) < SfcLi "i(^fc) whenever [Ak) is a sequence of subsets of S. 

(Saying that m is an outer measure on S is again a standard abuse of language: we should say that 
it is an outer measure on ^{S).) 

18.2.5. Example. Let C be a sequential covering class of a set S and ^: £ — >■ [0, oo] be a gauge 
on S. For every A(Z S define 

mi{A) := mil ^ ^(C^) : (C^) is a sequence in £ which covers A. > 
^k=l ^ 

(For convenience we will write m{A) for m^^A) when no confusion seems likely.) Then m is an 
outer measure on S. 

18.2.6. Example. The preceding example says that any gauge on an arbitrary sequential covering 
class of a set S generates an outer measure on S. It does not say that it generates an interesting 
outer measure. Let S = M. and £ be the sequential covering class consisting of all bounded open 
intervals in M. If a < 6 define i{{a, b)) = {b — a)~^ and let £(0) = 0. Describe the outer measure 
generated by the gauge i. 

18.2.7. Definition. Let m be an outer measure on a set S. A subset E ot S is m-MEASURABLE 

(or MEASURABLE WITH RESPECT TO THE OUTER MEASURE m) if 

m{T) = m{T n E) + m{T n E") 

for every T C 5. 

18.2.8. Exercise. Let £ be the family of open intervals in R of length 1 together with 0. Let 
^(0) = and £{C) = 1 if C G £ and C ^ $. Then £ is a sequential covering class and £ is a gauge 
on £. Denote by m the outer measure induced on *P(M) by i. 

(a) Find m({0}). 

(b) Find m((0, 1)). 

(c) Find m([0, 1]). 

(d) Which subsets of M are m-measurable? 

(e) What measure /x^ is induced by m on the cr-algebra of m-measurable subsets of M? 

The preceding exercise shows that the outer measure induced on a sequential covering class by 
a gauge may not be an extension of the gauge. What do wc have to require of a gauge i so that the 
outer measure m it induces will be an extension of £? Since m is countably subadditive whether 
or not £ is, it is clear that in order for m to be an extension of £ it is necessary that £ itself be 
subadditive. This is also sufficient. 
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18.2.9. Proposition. Let i be a gauge on a sequential covering class C of a nonempty set S and m 
be the outer measure induced by £. Then m{C) = 1{C) for every C £ ^ if and only if £ is countably 
subadditive (in the sense that £{C) < whenever C G C and (Cfc) is a sequence in (t 
which covers C ). 

18.2.10. Example. Arguably the most important example of a gauge is length 



defined on the sequential covering class of bounded open intervals in M. The outer measure m 
generated by this gauge is called Lebesgue outer measure. When m is Lebesgue outer measure 
on M, an m-measurable subset of M is a Lebesgue measurable set. 

18.2.11. Remark. Notice that in order to show that a set E is m-measurable it is necessary only 
to verify that m(T) > m(T (1 E) + m{T \ E) for each subset T of S, since the reverse inequality 
holds by virtue of the countable subadditivity of m. 

18.2.12. Proposition. Let m be an outer measure on a set S. A subset A of S is m-measurable 
if and only if for every e > there exists an m-measurable set E Q A such that m{A \ E) < e. 

18.2.13. Proposition. Let m be an outer measure on a set S. A subset A of S is m-measurable 
if and only if for every e > there exists an m-measurable set E such that m{A A E) < e. 

18.2.14. Definition. Let m be an outer measure on a set S. A subset ^ of S" is a null set 
with respect to the outer measure m if m{A) = 0. For clarity such a set is often referred to as an 
m-NULL SET to avoid confusion with the empty set. 

18.2.15. Proposition. // m is an outer measure on a set S, then every m-null subset of S is 
m-measurable. 

18.2.16. Proposition. Lf m is an outer measure on a set S, then the union of a countable family 
of m-null sets in S is itself an m-null set. 

18.2.17. Proposition. Let m be an outer measure on a set S and A <^ S. If Ei, . . . , En are 

disjoint m-measurable subsets of S, then 



18.2.18. Theorem (Caratheodory). Let m be an outer measure on a set S. Then the family 

of all m-measurable subsets of S is a a-algebra of subsets of S. Thus {S, is a measurable space. 
Furthermore, the restriction fim of m to the family Tim is a positive measure, so {S,Tlm, fJ-m) is a 
positive measure space. 

Proof. The proof is standard, but somewhat complicated. It is easy to find. Some sources 
are [1], Theorem 9.19; [18], Theorem 1.11; [28], chapter 11, lemma 7.3; [32], propositions 4.15 and 
4.16; and [ " ' ], chapter 12, theorem 1. 

18.2.19. Proposition. // an outer measure is finitely additive, then it must be countably additive. 

18.2.20. Proposition. Let A be a subset of M with finite Lebesgue outer measure m. Define 
F{x) = m(^A n (— oo, x]) for every G M. Then F is uniformly continuous. 



In this section we give some of the details concerning Lebesgue measure on the real line. Many 
authors carry out the details in the more general setting of Euclidean n-space. See for example [2], 
section 15; [1; ], chapter 2, section 6; [j l], volume 1, section 115; [37], chapter 2, 2.19-2.20; and 
[44] , chapter V. 
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18.3.1. Definition. Example 18.2.10 defines Lebesgue outer measure m = m£ (where £ is tfie 
lengtii function on bounded open intervals in M) and theorem 18.2.18 asserts that the restriction A 
of m to the u-algebra Tlx of Lebesgue measurable subsets of i? is a positive measure. This measure 
is Lebesgue measure on M. In the sequel when a measure is denoted by A it is intended to be 
Lebesgue measure. 

18.3.2. Proposition. Every subset of a measurable set with Lebesgue measure zero is itself mea- 
surable and has Lebesgue measure zero. 

18.3.3. Proposition. The Lebesgue outer measure of any interval in M is its length. 

Hint for proof. Before starting the proof it is good to observe that for a bounded open 
interval (a, 6) the assertion that m((a, 6)) = i(^{a,b)) may not be entirely obvious. Certainly we 
know that m((a, 6)) < l[{a, 5)). But why does the reverse inequality hold? How do we know that 
the interval (a, b) cannot be covered by a family of open intervals whose total length is strictly 
less than 6 — a? One's intuition, of course, rebels at the idea, but in measure theory, as in much 
of mathematics, one's intuition is not an infallible guide. Prove first that the assertion holds for 
closed bounded intervals (using a compactness argument); then prove it for all bounded intervals. 
Finally deal with the case of unbounded intervals. 

18.3.4. Example. The Lebesgue outer measure of the Cantor set is zero. 

18.3.5. Theorem. Every Borel set in M is Lebesgue measurable. 

Thus the collection of Lebesgue measurable subsets of M is quite large indeed. One may well 
wonder if there are any of M which are not Lebesgue measurable. We will see shortly (in exam- 
ples 18.3.9 and 18.3.10) that such sets do exist. The proof will require the axiom of choice. 

Another perfectly reasonable question: is every Lebesgue measurable subset of M a Borel set? 
A cardinality argument, which can be found in [23], remark 10.21, shows that there are in fact 2*^ 
Lebesgue measurable sets which are not Borel sets (where c is the cardinality of the continuum, 
that is, of the set of real numbers). The explicit construction of classes of Lebesgue measurable 
sets strictly containing the Borel sets is undertaken in [39] and [42]. 

18.3.6. Definition. A measure ^ defined on a cj-algebra 21 of subsets of an Abelian group G is 
TRANSLATION INVARIANT if ^ + X G 21 and p,{A + x) = ji{A) whenever ^ G 21 and x £ G. 

18.3.7. Proposition. Lebesgue measure is translation invariant on M (regarded as an Abelian 
group under addition). 

18.3.8. Proposition. Suppose that fi is a measure defined on a a-algebra 21 of subsets of a set S 
and that T is a member of%. Let OS = {^n T: ^4 G 21} and v = . Then (T, *B, v) is a measure 
space. 

The preceding proposition makes it possible to regard any measurable subset of a measure space 
as a measure space in its own right. The measure v in that proposition is the measure induced 
ON T BY |U. (Any restriction of Lebesgue measure A will also be denoted by A.) 

18.3.9. Example. The axiom of choice guarantees the existence of a set A contained in the interval 
(0, 1) which contains exactly one element from each of the equivalence classes which constitute the 
Abelian group M/Q. The set A is not Lebesgue measurable. 

Hint for proof. Show that (0,1) C []{A + r: r G Qn (-1,1)} C (-1,2). 

You can find a leisurely detailed treatment of this nonmeasurable set in [10], pages 289-291. 
It is not terribly difficult to generalize example 18.3.9 from the open unit interval to an arbitrary 
subset of M with strictly positive measure. 

18.3.10. Example. Every subset of the real line with strictly positive Lebesgue measure contains 
a nonmeasurable set. 
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Proof. See [23], theorem 10.28 or [37], (corollary to) theorem 2.22. 

18.3.11. Proposition. Lebesgue outer measure is not countably additive. 

18.3.12. Proposition. There does not exist a positive measure on ^(M) which is translation 
invariant and such that the measure of the closed unit interval [0, 1] is 1. 

18.3.13. Proposition. If A is a Lebesgue measurable subset o/M, then the (Lebesgue) measure of 
A is the infimum of the measures of open sets containing A. 

18.3.14. Proposition. If A is a Lebesgue measurable subset o/M, then the (Lebesgue) measure of 
A is the supremum of the measures of compact sets contained in A. 

18.3.15. Exercise. Find the largest number t having the following property: 

If any four Lebesgue measurable subsets of the unit interval are given, each having 
measure at least 1/3, then two of them intersect in a set whose Lebesgue measure 
is at least t. 

18.3.16. Definition. A function / : M M is said to be Lebesgue measurable if it is *Bor(]R))- 
measurable. A similar convention holds for functions between Euclidean spaces M*" and M". 

18.3.17. Proposition. Every Borel measurable function is Lebesgue measurable, but not con- 
versely. 

18.3.18. Proposition. // / be a function from (M, to (M, *Bor(M)). Then the following are 
equivalent: 

(a) / is Lebesgue measurable. 

(b) ((-oo, a)) C R for all a G M. 

(c) /^((a,oo)) C M for all a G M. 

18.3.19. Exercise. Recall that proposition 15.3.4 asserts that the composite of two measurable 
functions is measurable. 

(a) Prove that a Lebesgue measurable function is indeed a measurable function. 

(b) Prove that the composite of two Lebesgue measurable functions need not be Lebesgue 
measurable. 

(c) So proposition 15.3.4 says that the composite of two measurable functions must be mea- 
surable, but what you just proved in part (b) says that they need not be. Similarly, in [l- ' ] 
we find at the top of page 247, "If f : X ^ Y is measurable and g: Y ^ Z is measur- 
able, then the composite go f is measurable," while in [23] in the paragraph immediately 
preceding theorem 11.7 we read, "Thus the composition of two measurable functions need 
not be measurable." What is going on? Explain very carefully. 

(d) Restate exercise 15.3.4 so that no confusion can possibly result. 

18.4. The Space Loc,{S) 

18.4.1. Definition. Let {S,fi) be a (real or positive) measure space. A function / G M.{S) is 
essentially bounded if there exists M > such that |/| < M almost everywhere. The infimum 
of ah such numbers M is the essential supremum of |/| and is denoted by ||/||oo- The family of 
all essentially bounded functions on S is denoted by CooiS). 

18.4.2. Proposition. Let {S,iJ,) be a (real or positive) measure space and M{S) be the family 
of all measurable functions f on S such that f{x) = for almost all x £ S. Define Loo(5') := 
^ooiS)/M{S). Then there exists a unique norm (also denoted by \\ ■ \\) on Loo(»S') that makes the 
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following diagram commute. 
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Loo (5) 




The norm || • || is the ESSENTIAL SUPREMUM NORM (or L-INFINITY norm) on Loo (5'). 

18.4.3. Notation. If we wish to emphasize the role played by a measure /x we may write >Coo('S', fj,) 
or Loo{S,fi) for £oo('S') or Loo (-5), respectively. 

18.4.4. Example. If 5 is a (real or positive) measure space, then Loo (5") is a unital commutative 
normed algebra. 

18.4.5. Example. If S is a (real or positive) measure space, then Loo(<S') is a Riesz space. 

18.4.6. Definition. Let (S*, /x) be a measure space and / G M{S). The essential range of /, 
denoted by essran/, is the set of all A G M such that /i(/''~(Sg(A))) > for every e > 0. 

18.4.7. Example. 

(a) The essential range of the characteristic function : M ^ M is {0}, which is a proper 

subset of its range {0, 1}. 

(b) Let f{x) = arctanx for all x G M. Then the essential range of / is [— |, |], which properly 
contains its range (— |, |). 

18.4.8. Proposition. Let (5,//) be a measure space and f G £oo('S')- The essential range of f is 
a compact subset o/M and that ||/||oo = sup{|A|: A G essran/}. 

Note: This result also holds for [/] G Loo('S') because essran / = essran 5 whenever f = g a.e. 

18.4.9. Definition. Let A be a unital algebra and a e A. The SPECTRUM of a, denoted by cr(a), 
is the set of all A G C such that a — Al is not invertible. 

18.4.10. Example. If z is an element of the algebra C of complex numbers, then a{z) = {z}. 

18.4.11. Example. The family DJls of 3 x 3 matrices of complex numbers is a unital algebra under 

5 -6 -6" 

the usual matrix operations. The spectrum of the matrix 



-1 



is {1,2}. 



18.4.12. Example. Let X be a compact Hausdorff space. If / is an element of the algebra C{X) 
of continuous complex valued functions on X, then the spectrum of / is its range. 

18.4.13. Example. Let [/] G L^{S,iJ.) where (S, /x) is a measure space. Then cr([/]) = essran/. 
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THE LEBESGUE INTEGRAL 

One matter that must be clear from the start is what the Lebesgue integral is not. It is not 
a creature dependent in some way on Lebesgue measure. It is not by definition an integral of 
a function with respect to Lebesgue measure. We will define the Lebesgue integral of functions 
with respect to arbitrary real measures, complex measures, and positive measures. Of course, one 
special case is integration with respect to Lebesgue measure. But in general Lebesgue integration 
is defined for arbitrary measures. 

19.1. Integration of Simple Functions 

19.1.1. Definition. Let 21 be an algebra of subsets of a set S. An 21-SIMPLE function on 
S is a linear combination of characteristic functions of members of 21. When the algebra 21 is 
clear from context we will write simple functions for ^-simple functions. We conclude easily from 
examples 15.3.3 and 15.3.11 that on a measurable space every simple function is measurable. A 
moment's reflection indicates that a function a: 5 — ?■ M is simple if and only if 

(i) the range of a is finite, and 

(ii) 0"*~({y}) G 21 for every y £ ranu. 

Let 5im(5') be the family of all (real valued) simple functions on S and 5im(S')'*' be the family of 
all positive members of 5im(S'). 

Suppose a is an 2l-simple function on S and that rano" = {ai,...,Q„} (where the a^s are 
distinct). Let = cj^({afc}) for 1 < k < n. It is easy to see that 

n 

fc=i 

When a simple function is written in the form (19.1) we say that it is written in standard form. 

When (5", 21) is a measurable space we say that the A^'s form a measurable partition (or a 
MEASURABLE DECOMPOSITION) of S. That is, S = \Sk=i and each Ak is a measurable set. 

19.1.2. Exercise. Define a: [0, 10] ^ M by 

^ ^ ^^[0,4] ~ ^(3,6) ^ ^[5,6] ~ '^'^[8,10] ^^Q^[7,9] ' 

Write a in standard form. 

A pleasant and crucial observation is that on a measurable space every positive measurable 
function is the pointwise limit of an increasing sequence of positive simple functions. The proof is 
quite standard. 

19.1.3. Proposition. Let f : 5* — )• [0,oo] be a positive measurable function on a measurable space S . 
Then there exists a sequence (an) of simple functions on S such that 

< cr„ t / (ptws) . 

Proof. For each n € N let 

An,k = ( It) ) whenever I < k < n2'' 
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and 

B„ = /^([n,oo)) . 

Then define 

fc=i 

It is not difficult to see that the sequence ((t„) satisfies < (7„ t /• (Notice, in particular, that if x 
is fixed, then an{x) > f{x) — 2~" for all n > f{x).) □ 

19.1.4. Exercise. Let /: (0, 3] — )• M: x i— t- ^. Make a careful sketch of the corresponding simple 
functions (72 and as which appear in the proof of proposition 19.1.3. 

19.1.5. Example. If 21 is an algebra of subsets of a set S, then 5im(S') is a Riesz space. Notice 
that, in particular, if (5,21) is a measurable space, then 5im(S') is a Riesz subspace of Ai{S) (see 
example 15.3.11). 

19.1.6. Example. If 21 is an algebra of subsets of a set S, then 5im(S') is an algebra. Moreover, 
it is a normed algebra (under the uniform norm). If (5,21) is a measurable space, then 5im(5) is 
a subalgebra of 7W(5) (see example 15.3.12). 

Now, supposing that our algebra of subsets is equipped with a finitely additive set function /i. 
Then we can define the integral of a simple function with respect to fi. 

19.1.7. Definition. Let ^ be a real finitely additive set function on an algebra 21 of subsets of 
a set S and cr = Yl ^kXj^ be an 2l-simple function on S written in standard form. Then the 
Lebesgue integral of cr (with respect to /i) is defined by 

/ adfi:=Y,akKAk). (19.2) 

•J s 

When no confusion seems likely we may simplify the notation to J a dfi or even J a. In these notes 
we are interested primarily in the Lebesgue integral. So whenever the unmodified term integral 
appears we take it to mean Lebesgue integral (unless, of course, context indicates otherwise). 

The next two exercises are almost obvious. The first says that in order for equation (19.2) to 
hold it is not important that a be written in standard form. Any decomposition of its domain into 
members of the algebra 21 such that a is constant on each member of the decomposition will do. 

19.1.8. Proposition. Let 21 be an algebra of subsets of a set S, fi be a real finitely additive set 
function on 21, a be an ^-simple function on S, and {Ai, . . . , An} be a decomposition of S into 
members of^ such that a is constant on each Ai^.. Then j cr = ^akfJ-iAk), where is the value 
of (T on A}^ (1 < k < n ). 

19.1.9. Proposition. Let a and r be simple functions on the same algebra 21 of subsets of a set 
S and n be a real finitely additive set function on 21. // ^{{x G S : cf{x) ^ t(x)}) = 0, then 
J adfi = frdfi. 

19.1.10. Exercise. Let a be the simple function defined in exercise 19.1.2 and A be Lebesgue 
measure. Compute / a dX. 

19.1.11. Proposition. Let 2t be an algebra of subsets of a set S, fj, be a real finitely additive set 
function on 21, and a be an ^-simple function on S. Then 

a djJL = I dfi — / dfi . 
s Js Js 
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19.1.12. Proposition. Let ^ he a real measure on a measurable space S and a he a simple function 
on S. Then 

a djj = / a dfi^ — / cr d/i^ . 
Js Js 

19.1.13. Example. Let 2t be an algebra of subsets of a set S and /i be a real finitely additive set 
function on 21. Then the map 

Jl : 5im(5) — M : cr i— / a d^ 

Js 

is a linear functional on the vector space 5im(S'); that is, Jl G 510(5")^. 



19.1.14. Example. If ^ is a positive finitely additive set function on an algebra 2t of subsets of a 
set S, then the function 



IJ, : 5im(S') — M : tr i— )■ / a dfi 

Js 



is a positive (and therefore order bounded — see proposition 14.3.5) linear functional on the Riesz 
space Sim{S). That is, Ji G 5im(5)~. 

19.1.15. Example. If /i is a bounded real finitely additive set function on an algebra 21 of subsets 
of a set S, then the function 

Jl : 5im(S') — M : cr i— / a dfi 

Js 

is a bounded linear functional on the normed linear space 510(5"). That is, Jl G Sini{S)* . 

19.1.16. Proposition. Let 21 be an algebra of subsets of a set S. Then the map 

^: ba5^5im(S)*: /x ^ 
(where Jl is defined in example 19.1.15) is an isometric isomorphism. 

19.1.17. Definition. Let be a real finitely additive set function on an algebra 21 of subsets of a 
set S, a be an 2l-simple function on S, and G 21. Then the Lebesgue integral of a over E 
(with respect to h) is defined by 

/ a dfi := / Xp'^dfi. (19.3) 
Je Js 

19.1.18. Exercise. Alter definition 19.1.7 to define the Lebesgue integral of a positive simple 
function cr on a measurable space {S, 21) with respect to a positive measure ^. (Explain why it is a 
good idea to restrict our attention to positive simple functions.) Verify that if a and r are positive 
simple functions and a > 0, then J {a + t) dfi = J a dfj, + J t dfi and / aa dfj, = a j a dfi. 

19.2. Integration of Positive Functions 

19.2.1. Definition. Let /x be a positive measure on a measurable space S and /: S" — )• [0, (X)] be 
a positive measurable function on S. Then the Lebesgue integral of / (with respect to n) is 
defined by 

J f dfj, = sup I y a dfi: a G 5im(5) and < cr < / 1 . 
As with simple functions if £^ is a measurable subset of S we define 

fdfi= Xpfdfi. 
Js 

19.2.2. Proposition. Let {S,fi) be a positive measure space and f,g:S^ [0,oo] be positive 
measurable functions on S. If f < g, then f dfi < gdfi. 
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19.2.3. Theorem (Monotone Convergence Theorem). Let {S,n) be a positive measure space and 
(fn) be a sequence of measurable functions from S into [0, oo]. If ^ < fn \ 9 (ptws), then 



gdii= hm / fnd^i. 

Proof. See [10] theorem 18.7; [11], proposition 2.3.3; [ ], theorem 4.6; or [37], theorem 1.26; 

19.2.4. Proposition. Let f , g: S ^ [0;Oo] be positive measurable functions on a positive measure 
space {S,fi). Then Jg{f + g)dfi = jgfd^ + J^gdfJ,. 

19.2.5. Proposition. Let / : 5 — [0,oo] be a positive measurable function on a positive measure 
space {S, fi) and a > 0. Then af dfj, = a f^fd/j,. 

19.2.6. Proposition. Let /: 5 — t- [0,oo] be a positive measurable function on a positive measure 
space (5, fi). Then f = a.e. if and only if fgfdfi = 0. 

19.2.7. Proposition. Let /: S — )• [0, oo] be a positive measurable function on a positive measure 
space {S, n) and A and B be measurable subsets of S with A Q B. Then Jj^fdfi< f dfi. 

19.2.8. Proposition. For each n £ N let fn - S ^ [0,oo] be a positive measurable function on a 
positive measure space {S, /i) . Then 



^ / oo \ oo ^ 

/ (^/n)d/i = 5^ fndfl. 



19.2.9. Corollary. Suppose that ajk > for every j, k gN. Then 



oo oo 



j=l k=l k=l j=l 

19.2.10. Proposition (Fatou's lemma). For each n £N let fn - 5* — t- [0, oo] be a positive measurable 
function on a positive measure space {S,fx). Then 



(hm inf /„) dfi < hm inf / /„ dfj,. 

Js 



19.2.11. Example. Give an example to show that the inequahty in the preceding proposition may 
be strict. 

19.2.12. Proposition. Let / : S* — [0, oo] be a positive measurable function on a positive measure 

m 

space (5*, /Li). Define for every ACS 



u{A)= I fdfi. (19.4) 

Then v is a positive measure on S and 

'' gdu= I gfd^i (19.5) 



for every measurable g >0 on S. 

The equation (19.5) is often written in abbreviated form 

du = f dn 

or even more fancifuhy 

— ~ / 
dji 

For this reason / is often referred to as the Radon-Nikodym derivative of u with respect to ii. 
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19.2.13. Definition. Let be a real measure on a measurable space S and /: S" — )• [0,oo] be a 
positive measurable function on S. Then we define 



fdfi= / / (i;u+ - fdn 
Js Js 

whenever not both f d/j,'^ and Jg f dfj,~ are infinite. 

19.3. Integration of Real and Complex Valued Functions 

19.3.1. Definition. Let (5*, /u) be a positive measure space. A measurable complex (or extended 
real) valued function is (Lebesgue) integrable if fg\f\ dfi < oo. We denote the family of all 
complex valued Lebesgue integrable functions on S by C) or Ci{S,C) and the family of all 

extended real valued Lebesgue integrable functions on S by Ci{fi) or Ci{S). If / G we define 

[ fdfi= [ f+dfi - [ f-df, 
Js Js Js 

and if / G Ci{n, C) we define 

f dfi = / udjjL + i / V dpL 



where u{x) is the real part of f{x) and v{x) is the imaginary part of f{x) for each x £ S. 

Alternative notations for the Lebesgue integral are f{x) dfj,{x) and / f dfi. The notations 
f{x) d\{x) and Jg f dX denote integration with respect to Lebesgue measure. Because of propo- 
sition 19.3.18 the integral with respect to Lebesgue measure is frequently written as Jg f{x) dx. 
When / is a function defined on an interval [a, b] it is common practice to write jj* / dfj, for J^^ ^ f dfj,. 
A similar convention holds for other intervals (a, 6], [a,oo), etc. 

19.3.2. Proposition. If (S", /x) is a positive measure space, then £i(/i) is a Riesz space and the 
map 

p:£l(/x)^M: / fdn 

Js 

is a positive linear functional on Ci (fi) . 

19.3.3. Corollary. /// is a real valued integrable function on a positive measure space {S,fi), then 



fdfi 



Js 



Proof. The result is follows immediately from propositions 19.3.2 and 14.3.13(b). □ 

19.3.4. Proposition. // / is a complex valued integrable function on a positive measure space 
{S,ii), then 



fdfi 



< [ \f\dl^. 
Js 



19.3.5. Proposition. If{S, /x) is a positive measure space then the map f i— )• Jg\f\ dfi is a seminorm 
on the vector space £i(/x). 

19.3.6. Proposition. Let (5*, /x) be a positive measure space and f , h €z Ci{p,). If g is a measurable 
function on S such that f < g < h a.e., then g G Ci{fi). 

19.3.7. Theorem (Lebesgue Dominated Convergence Theorem). Let (5*, /u) be a positive measure 
space and (fn) be a sequence on measurable complex valued functions on S. If there exists an 
integrable function g such that < g for every n G N and h{x) = lim„/„(x) exists for every 
X £ S, then h G £i(^, C) and 

hdfi = lim / /„ dfi. 
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Hint for proof. Apply Fatou's lemma to the sequences {g + fn) and {g — fn)- 

19.3.8. Proposition. /// and g are measurable functions on a positive measure space {S,fi) such 
that J\f — g\dfj, = 0, then f = g a.e. 

19.3.9. Example. Let [x be counting measure on N and let = (— l)"/n for each n G N. Then 
the infinite series ^ a„ converges (by the alternating series test) but, as a function on N, is not 
integrable with respect to fi. 

19.3.10. Example. The function x i— t- — is not integrable (with respect to Lebesgue measure) on 

X 

the interval (0, 1]. 

19.3.11. Example. The function x i— )• —= is integrable (with respect to Lebesgue measure) on 

^/x 

the interval (0, 1]. 

Hint for proof. For each n G N let 

n-l 

T„ = (n + 1)Y, ,+y^(A; + l)Y, 

(o,"-^] ^ ((fc+l)-^fc-2] 

19.3.12. Proposition. Let f be an integrable function on a positive measure space (5", /x) and (En) 
be a sequence of pairwise disjoint measurable subsets of S whose union is F. Then 



Jf Je, 



19.3.13. Proposition. If{S,fj,) is a positive measure space and f is an integrable function on S, 
then J^fdfi = for every measurable subset A of S if and only if f = a.e. on S. 

19.3.14. Proposition. Let (/S, /u) be a positive measure space andN{S) be the family of all mea- 
surable functions f on S such that f{x) = for almost all x £ S. Define Li(5) := Ci{S)/M{S). 
An alternative notation for Li(S') is Li(//). Then there exists a unique norm on Li(5) that makes 
the following diagram commute. 

£i(5) 



TT 




19.3.15. Proposition. On a positive measure space (5*, /u) let Li(S') be defined as in proposi- 
tion 19.3.14- Then Li(S') is a Riesz space. 

19.3.16. Proposition. If S is a positive measure space, then the set of all integrable simple func- 
tions is dense in the normed linear space Li(5). 

19.3.17. Proposition. Suppose that {fn) is a sequence of measurable functions defined on a mea- 
surable subset E of a positive measure space {S,^) such that X^^Lil/fcl ^■s 0,^ integrable function 
on E. Then the function YlV=i fk integrable and 



/oo oo ^ 

k=l k=l ''^ 
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19.3.18. Proposition. Let f he a hounded real valued function on the interval [a, 6] in M. If f 

is Riemann integrahle, then it is (Lehesgue measurable and) Lehesgue integrable and its Lebesgue 
integral is the same as its Riemann integral; that is, 

fd\= f f{x)dx. 

Proof. See [2], theorem 19.6; [l], theorem 1.7.1(b); [a. theorem on page 99; [-£], theorem 
2.28(a); or [32], theorem 3.23. 

sin X 

19.3.19. Example. Let f(x) = for x > and /(O) = 1. Then / has a finite improper 

X 

Riemann integral but (while it is Lebesgue measurable) it is not Lebesgue integrable. 



be found in [2], theorem 20.7, case II, where it is shown that dx 



Proof. The somewhat complicated business of computing the integral of the function / can 

^oo sinx TT 

dx = —. 

X 2 

19.3.20. Proposition. Let f he a hounded real valued function on the interval [a,b] in M. Then 
f is Riemann integrahle if and only if it is continuous almost everywhere (with respect to Lehesgue 
measure). 

Proof. See [2], theorem 19.7; [ '.], theorem 1.7.1(a); [" theorem on page 99; or [".8], theorem 
2.28(b). 

19.3.21. Exercise. Regard S = (0, 3] as a positive real measure space under Lebesgue measure A. 
Define functions / and g on S hy 

f{x) = sin fx and ^(x) = ^ - X(i 2] + '^^{2,3] ' 
Define real measures fi and v by 

i^{A)= [ fdX and ^^{A)= [ g dX 
J A J A 

m 

for every A C. S. 

(a) Find a Hahn decomposition of (5, /x). 

(b) Find the Jordan decomposition of ^. (Express your answer in terms of Lebesgue measure.) 

(c) Find \^l\{A) for ACS. 

(d) Find (/iVz/)((0,2]). 
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COMPLETE METRIC SPACES 



20.1. Cauchy Sequences 

20.1.1. Definition. A sequence (xn) in a metric space is a Cauchy sequence if for every e > 
there exists no G N such that d{x„i, x„) < e whenever m, n > uq. This condition is often abbreviated 
as follows: d{xm, Xn) — as m, n — )• oo (or limm„ra-s>oo d{xm, Xn) = 0). 

20.1.2. Proposition. In a metric space every convergent sequence is Cauchy. 

20.1.3. Proposition. Every Cauchy sequence which has a convergent subsequence is itself conver- 
gent (and to the same limit as the subsequence). 

20.1.4. Proposition. In a metric space every Cauchy sequence is bounded. 

20.1.5. Definition. A metric space M is complete if every Cauchy sequence in M converges. 
Completeness is not a topological property of a metric space, as the next example shows. 

20.1.6. Example. Consider the interval J = (0, 1] under the usual metric {d{x,y) = |x — y\) and 



under the metric d' defined by d'{x,y) 



1 _ 1 

X y 

topologies; but one is complete and the other is not. 



. This produces two metric spaces with identical 



20.1.7. Definition. The diameter of a subset ^ of a metric space is defined by 

diam^l := sup{(i(a;, y) : x,y & A} 
if this supremum exists. Otherwise diam^ := oo. 

20.1.8. Proposition. If A is a subset of a metric space, then diamA = diam74. 

20.1.9. Proposition. In a metric space M the following are equivalent: 

(a) M is complete. 

(b) Every decreasing sequence of nonempty closed sets in M whose diameters approach has 
nonempty intersection. 

Hint for proof . To show that (a) implies (b), let (Fk) be a decreasing sequence of nonempty 
closed subsets of M. For each k choose Xk G F^. Show that the sequence (xk) is Cauchy. To show 
that (b) implies (a), let [xk] be a Cauchy sequence in M. Define An = {xj-: k > n} and F„ = An. 
Show that {Fn) is a decreasing sequence of closed sets whose diameters approach 0. Choose a point 
a in nF„. Find a subsequence {xn^.) of such that d{a,Xn^.) < 2~*^. 

20.1.10. Example. Since R is complete, the preceding proposition tells us that every nested 
sequence of nonempty closed subsets of M whose diameters approach has nonempty intersection. 

(a) This statement is no longer correct if the words "whose diameters approach 0" are deleted. 

(b) The statement is no longer correct if the word "closed" is deleted. 

20.1.11. Proposition. Every complete subspace of a metric space is closed and every closed subset 
of a complete metric space is complete. 

20.1.12. Proposition. Every compact metric space is complete. 

20.1.13. Proposition. The product of two complete metric spaces is complete. 
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20.1.14. Proposition. If d and p are strongly equivalent metrics on a set M, then {M,d) is 
complete if and only if {M, p) is. 

20.1.15. Example. The conclusion of the preceding proposition may not hold if the two metrics 
are merely equivalent. 

20.1.16. Definition. A metric space {M,d) is totally bounded if for every e > there exists 
a finite subset F of M such that for every a € M there is a point x £ F such that d{x, a) < e. 
The finite set F is frequently called an e-NET for M. This definition has a more or less standard 
paraphrase: A space is totally bounded if it can be kept under surveillance by a finite number of 
arbitrarily near-sighted policemen. 

The following equivalent formulations of total boundedness appear in [26]. 

20.1.17. Proposition. For a metric space {M,d) the following are equivalent: 

(a) Every sequence in M has a Cauchy subsequence. 

(b) The space M is totally bounded. 

(c) For every e > the space M can be written as a union of a finite collection of sets each 
with diameter strictly less than e. 

(d) Suppose e > 0. Then every infinite subset of M contains an infinite set of diameter strictly 
less than e. 

Hint for proof . To prove (a) implies (b), let e > and 5 be the family of all subsets F of M 
with the property that d{x,y) > e for every pair of distinct points x and y in F. Partially order ^ 
by inclusion. Use Zorn's lemma. 

To prove (d) implies (a) let (x^) be a sequence in M. Without loss of generality suppose 
Xj / whenever j / k. Choose a subsequence (xi fc) of (x^) such that the distance between 
any two elements in its range is strictly less than 1. Next choose a subsequence (x2,fc) of (xi^k) 
such that the distance between any two elements in its range is strictly less than ^. Next choose a 
subsequence {x^^k) of {x2,k) such that the distance between any two elements in its range is strictly 
less than |. Continue inductively. What can you say about the subsequence (x^^fc)? 

20.1.18. Proposition. A metric space is compact if and only if it is complete and totally bounded. 

We extend the notion of an infinite series from the real numbers to an arbitrary normed linear 
space. The language is exactly the same. 

20.1.19. Definition. Let be a sequence in a normed linear space V. The sum s„ = Yl^=i 

of the first n terms of the sequence is the n^'^-PARTiAL SUM of the sequence. The sequence of 
partial sums is an infinite series and is denoted by XlfcLi ^k- The series is said to converge if 
the sequence (s„) of partial sums converges and, if the series converges, its sum is the limit of the 
sequence (s„) of partial sums. The notation for the sum of the series is, confusingly, the same as 
for the series itself X^^Li^fc- Notice that saying that the series 'YlT=i^k converges is exactly the 
same as saying that the sequence (x^) is summable (see definition 12.1.29). 

Similarly, the assertion that the series X^^Li ^k converges absolutely is the same as claiming that 
the sequence {xn) is absolutely summable. That is, the series X^^Li Xk is absolutely convergent 
if Efclilkfell < oo- 

20.1.20. Proposition. A normed linear space V is complete if and only if every absolutely sum- 
mable sequence in V is summable. 

20.1.21. Proposition. If M is a closed subspace of a complete normed linear space V , then V/M 
is complete. 

20.1.22. Example. The normed algebra B{S) (see example 12.1.9) is complete. 

20.1.23. Example. If X is a topological space then the normed algebra Cb(X) (see example 12.3.17) 
is complete. A special case of this occurs when X is a compact topological space. Then C{X) is a 
complete normed algebra. 
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20.2. Completions and Universal Morphisms 

20.2.1. Definition. Let M and be metric spaces. We say that iV is a COMPLETION of M if A?" 
is complete and M is isometric to a dense subset of A'^. 

20.2.2. Example. If in the definition 20.2.1 of completion we were to substitute the word "home- 
omorphic" for "isometric" then the uniqueness result 20.2.11 would no longer be true. In fact, it 
would then be possible to have two "completions" of a metric space that are not even homeomor- 
phic. 

In the construction of the real numbers from the rationals one approach is to treat the set of 
rationals as a metric space and define the set of reals to be its completion. One standard way of 
producing the completion involves equivalence classes of Cauchy sequences of rational numbers. 
This technique extends to metric spaces: an elementary way of completing an arbitrary metric 
space starts with defining an equivalence relation on the set of Cauchy sequences of elements of the 
space. Here is a slightly less elementary but considerably simpler approach. 

20.2.3. Proposition. Every metric space M is isometric to subspace of B{M). 

Hint for proof . If {M,d) is a metric space fix a G M. For each x £ M define (px - M ^ M 
by (pxiu) = d{x,u) — d(u,a). Show first that (px £ B{M) for every x G M. Then show that 
(f): M ^ B{M) is an isometry. 

20.2.4. Corollary. Every metric space has a completion. 

Much of mathematics involves the construction of new objects from old ones — things such as 
products, coproducts, quotients, completions, compactifications, and unitizations. More often than 
not it is possible to characterize such a construction by the existence of a unique morphism having 
some particular property. Because this morphism and its corresponding property characterize the 
construction in question, they are referred to as a universal morphism and a universal property, 
respectively. Here is one very common way in which such morphisms arise. 

20.2.5. Definition. Let A — be a functor between categories A and B and B be an object 
in B. A pair {A, u) with A an object in A and u a B-morphism from B to f{A) is a universal 
MORPHISM for B (with respect to the functor F) if for every object A' in A and every B-morphism 

/ / 
B s-F(A') there exists a unique A-morphism A ^A' such that the following diagram commutes. 

B — ^ F(^) A 




F{A') A' (20.1) 

In this context the object A is often referred to as a UNIVERSAL object in A. 

20.2.6. Definition. As mentioned earlier the categories of interest in these notes are concrete 
categories (see 8.1.6). If A is an object in some concrete category C, we denote by OA its underlying 

/ 

set. And if A ^ B is a morphism in C we denote by □/ the map from OA to OB regarded 

simply as a function between sets. It is easy to see that □ , which takes objects in C to objects in 
Set (the category of sets and maps) and morphisms in C to morphisms in Set, is a functor. It is 
referred to as a forgetful functor. In the category Vec of vector spaces and linear maps, for 
example, □ causes a vector space V to "forget" about its addition and scalar multiplication (OV 
is just a set). And if T: F — )• is a linear transformation, then OT: OV — )■ OW is just a map 
between sets — it has "forgotten" about preserving the operations. 

Occasionally it is useful to allow a forgetful functor to forget some of the structure of a category 
but not all. For example, □ might take the category ALG to the category VEC by forgetting about 
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just multiplication or, perhaps, it might take the category MET of metric spaces and continuous 
maps to the category TOP by forgetting about the metrics but not the topologies they induce. 

The next proposition shows that the completion of a metric space is universal. In this result 
the categories of interest are the category of metric spaces and uniformly continuous maps and 
its subcategory consisting of complete metric spaces and uniformly continuous maps. Here the 
forgetful functor □ just forgets about completeness. 

20.2.7. Proposition. Let M he a metric space and M he its completion. If N is a complete metric 
space and f : M ^ N is uniformly continuous, then there exists a unique uniformly continuous 
function f : M ^ N which makes the following diagram commute. 

M M 




Y 



UN N 

20.2.8. Example. Here is the usual presentation of the coproduct of two objects in a category. 
Let Ai and A2 be two objects in a category C. A COPRODUCT of Ai and A2 is a triple ((3,^1,^2) 

with Q an object in C and A^ — *^ Q {k = 1,2) morphisms in C which satisfies the following 

fk 

condition: if B is an arbitrary object in C and Ak ^B {k = 1, 2) are arbitrary morphisms in C, 

/ 

then there exists a unique C-morphism Q ^ B which makes the following diagram commute. 



B 




Ai^^Q^^^A2 (20.2) 

It may not be obvious at first glance that this construction is universal in the sense of defi- 
nition 20.2.5. To see that it in fact is, let D be the diagonal functor from a category C to the 
category of pairs (see example 15.4.7). Suppose that {Q, li, L2) is a coproduct of the C-objects 

Ai and A2 in the sense defined above. Then A = (^41,742) is an object in C^, A^D{Q) IS a 
C^-morphism, and the pair [Q, l) is universal in the sense of 20.2.5. The diagram corresponding 
to diagram (20.1) is 



A^^D{Q) Q 




D{B) B (20.3) 

fk 

where B is an arbitrary object in C and (for k = 1,2) A^ ^B are arbitrary C-morphisms so 

that / = (/i, /2) is a C^-morphism. 

20.2.9. Example. Let 5 be a set and VEC — ^^SET be the forgetful functor from the category 
VEC to the category SET. If there exists a vector space V and an injection S — UV which 
constitute a universal morphism for S (with respect to □), then V is the free vector space 
over S. Of course merely defining an object does not guarantee its existence. In fact, free vector 
spaces exist over arbitrary sets. Given the set S let V be the set of all real valued functions 
on S which have finite support. Define addition and scalar multiplication pointwise. The map 
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t: s I— )• X|^| of each element s G 5 to the characteristic function of {s} is the desired injection. To 
verify that V is free over S it must be shown that for every vector space W and every function 

/ 7 

S n\W there exists a unique hnear map V which makes the fohowing diagram commute. 

S '-^DV V 




1 



nw w 

20.2.10. Proposition. Universal objects in a category are essentially unique. 

The following consequence of propositions 20.2.7 and 20.2.10 allows us to speak of the comple- 
tion of a metric space. 

20.2.11. Corollary. Metric space completions are unique (up to isometry). 

20.3. Compact Subsets of C{X) 

20.3.1. Definition. Let X be a topological space, (A^, p) be a metric space, And J-" be a nonempty 
subset of C{X,N). The family F is equicontinuous at a point a £ X li for every e > there 
exists a neighborhood U oi a such that p[f{x), /(a)) < e for every f ^ T and every x G C/. The 
family is equicontinuous on a set A C X if it is equicontinuous at every point in A. 

If {M,d) and (iV, are metric spaces and is a nonempty subset of C{M,N) we say that 
the family J" is uniformly equicontinuous on M if for every e > there exists S > such 
that p[f{x),f{y)) < e whenever / E -F and d{x,y) < 6. (Where we use the expression uniformly 
equicontinuous many authors use just equicontinuous. We show below in proposition 20.3.4 that, 
at least on compact metric spaces, the two notions coincide.) 

Careful and readable discussions of equicontinuity can be found in [10], pages 178-183; [20], 
pages 365-374; and [ ], pages 164-169. You can find some instructive and interesting exercises 
on equicontinuity in [ ], (4.5.7) and (4.5.9); [iO], pages 179-180 and 182-183; ['>n], pages 372-374, 
and [ ], pages 168-169. 

20.3.2. Example. For each n G N let /„: [0,1] ^ M be defined by /„(x) = x". The family 
{fn- n &N} is not equicontinuous at x = 1. 

20.3.3. Example. The family {/„ : n G N} of functions defined in the preceding exercise is equicon- 
tinuous at X = 

20.3.4. Proposition. A nonempty family T of continuous functions from a compact metric space 
(M, d) into a metric space (N, p) is equicontinuous on M if and only if it is uniformly equicontinuous 
on M. 

20.3.5. Definition. Let S" be a nonempty set and M be a metric space. A family of functions 
/: 5 — ^ M is pointwise bounded if {/(x) : / G J"} is a bounded subset of M for each x £ S. It 
is common to say that the family is uniformly bounded if it is a bounded subset of the metric 
space B{S,M). 

Two things should be clear: 

(1) uniform boundedness implies pointwise boundedness; and 

(2) the word "uniformly" in uniformly bounded is, strictly speaking, redundant — it is there 
for emphasis. 

20.3.6. Proposition. LetT be an equicontinuous family of functions inC{X) where X is a compact 
topological space. If J- is pointwise bounded, then it is (uniformly) bounded. 
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20.3.7. Proposition. A family T of functions in C{X), where X is a compact topological space, 
is totally bounded if and only if it is equicontinuous and pointwise bounded. 

Proof. See [ ' proposition 5.1.10. 

20.3.8. Theorem (Ascoli-Arzela theorem). Let X be a compact topological space. A family T C 
C(X) is compact if and only if it is closed, pointwise bounded, and equicontinuous. 

The preceding theorem also goes under the names Ascoli's theorem and the Arzeld-Ascoli the- 
orem. 

A subset j4 of a topological space X is said to be relatively compact if its closure is compact. 
Thus the Ascoli-Arzela theorem may be given a slightly shorter form: For a compact space X a 
subset T C C(X) is relatively compact if and only if it is bounded and equicontinuous. 



20.4.1. Definition. A Banach space is a complete normed linear space. We denote the category 
of Banach spaces and bounded linear maps by BAN. A Banach algebra is a complete normed 
algebra. 

20.4.2. Example. Under its usual operations M is a unital commutative Banach algebra. 

20.4.3. Example. Let 5 be a nonempty set. Under the usual pointwise operations B{S) is a unital 
commutative Banach algebra. (See example 20.1.22.) 

20.4.4. Example. Let X be a nonempty compact Hausdorff space. Under the usual pointwise 
operations C{X) is a unital commutative Banach algebra. (See example 20.1.23.) By the extreme 
value theorem 17.1.18 it is a unital (Banach) subalgebra of B{X). 

20.4.5. Example. Let X be a nonempty topological space. Under the usual pointwise operations 
Ci,{X) is a unital commutative Banach algebra. (See example 20.1.23.) 

20.4.6. Example. Let X be a nonempty locally compact Hausdorff space. A complex valued 
function / on X is said to VANISH AT infinity if for every e > there exists a compact set outside 
of which |/(x)| < e. The family of all continuous complex valued functions on X which vanish at 
infinity is denoted by Cq{X). Under the usual pointwise operations and the uniform norm Cq{X) 
is a (in general nonunital) commutative Banach algebra. 

20.4.7. Example. Let V and be a normed linear spaces. Then 5S(y, VF) is a Banach space if 
and only if W is complete. In particular the dual V* of any normed linear space U is a Banach 
space. Furthermore, if V is complete, then the family 5S(y) of operators on \^ is a unital (in general 
noncommutative) Banach algebra. 

20.4.8. Example. The set M„ of n x n matrices of real numbers is a unital (in general noncom- 
mutative) Banach algebra. (See example 4.6.10.) 

20.4.9. Example. If (5, /i) is a positive measure space, then (under pointwise multiplication of 
functions) Loo(5') is a unital commutative Banach algebra. 

20.4.10. Example. If (5, /i) is a positive measure space, then Li(S') is a Banach space. (See 
proposition 19.3.14.) 

20.4.11. Definition. Suppose that (5, /i) is a positive measure space and 1 < p < oo. Let Cp{S) 
be the set of all measurable functions f on S such that Jg\f\^dfj, < oo. The mapping 



is a seminorm on Cp{S). Identifying functions that differ only on a set of measure zero produces a 
normed linear space Lp(5) (or Lp(/x)) in exactly the same way as the space Li(S') was created in 
proposition 19.3.14. The resulting norm, also denoted by || • is the LP NORM on Lp(5). 



20.4. Banach Spaces; Lp-spaces 
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In these notes when 5 is a subset of M and no measure is specified, Lebesgue measure is 
understood; that is, Lp(S') is taken to mean Lp(S', A). 

20.4.12. Definition. If 1 < p, g < oo we say that p and q are CONJUGATE exponents if - + - = 1. 

p q 

In particular, we take 1 and oo to be conjugate exponents. 

20.4.13. Proposition (Holder's inequality). Let {S,fi) be a positive measure space and p and q be 
conjugate exponents where 1 < p < oo. If f £ Lp(S') and g G Lq{S), then fg G Li(S') and 

\\f9\\i<\\f\\p\\9\U. 

Proof. See [2], theorem 25.3; [4], 2.4.5; [18] (6.2); [32], theorem 9.9; or [37], theorems 3.5 
and 3.8. 

20.4.14. Example. The space L2([0,l]) is contained in Li([0,l]). 

The preceding example easily generalizes to all spaces of finite measure but, as the next example 
shows, not to spaces of infinite measure. 

20.4.15. Example. The function x ^-^ ^ belongs to L2([l,oo)) but not to Li([l,oo)). 

20.4.16. Proposition (Minkowski's inequality). Let {S,ij,) be a positive measure space and 1 < 
p < oo. If f £ Lp{S),then 

\\f + 9\\p<\\f\\p + \\g\\p. 

Proof. See [2], theorem 25.4; [4], 2.4.7; [18] (6.5); [32], theorem 9.10; or [37], theorems 3.5 
and 3.9. 

20.4.17. Example. Let {S,n) be a positive measure space and 1 < p < oo. Then Lp(S') is a 
Banach space. 

The next result generalizes proposition 19.3.16. 

20.4.18. Proposition. Let {S,fx) be a positive measure space and 1 < p < oo. Then the set of all 
integrable simple functions is dense in Lp{S). 

The notation || • ||p for the norm on the space Lp(S') seems natural enough. But some people 
find the notation || • ||oo for the essential supremum norm a trifle odd. Perhaps the next result will 
make it seem more plausible. 

20.4.19. Proposition. Let E be a Lebesgue measurable subset o/M" with nonzero finite Lebesgue 
measure and let f be a real (or extended real) valued function on E. Then asp runs over the natural 
numbers we have 

lim||/||p=||/|U. 

Hint for proof . Show that 

(1) limsupp^oo||/||p < ll/lloo and 

(2) liminfp_^oo||/||p > M for every M such that < M < ||/||oo. 

Two other important normed linear spaces, which we have already encountered, are complete: 
ba(S') and ca(S'). (See example 16.5.19.) 

20.4.20. Example. If 21 is an algebra of subsets of a set S", then under the norm defined in 
example 16.5.19 the space ba(5) is a Banach space. If 21 is a cj-algebra of subsets of a set 5, then 
ca(S') is also a Banach space. 

Proof. See [_.], theorems 9.52 and 9.55. 

It is also pleasant to discover that on a normed linear space, switching from the given norm to 
an equivalent one does not affect completeness of the space. 

20.4.21. Proposition. Suppose that \\ ■ \\ and \\\ ■ \\\ are equivalent norms on a vector space V . If 
V is a Banach space under \\ ■ \\, then it is also a Banach space under \\\ ■ \\\. 
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20.5. Banach Algebras 

20.5.1. Definition. A map f : A ^ B between Banach algebras is a (Banach algebra) homo- 
MORPHISM if it is both an algebraic homomorphism and a continuous map between A and B. We 
denote by BALG the category of Banach algebras and continuous algebra homomorphisms. 

20.5.2. Proposition. Let A and B be Banach algebras. The direct sum of A and B, denoted 
by A(B B, is defined to be the set A x B equipped with pointwise operations: 

(i) (a, 6) + (a', 6') = {a + a',b + b'); 

(ii) (a, 6) (a', 6') = {aa',bb'); 

(iii) a{a,b) = {aa,ab) 

for a,a' £ A, b, b' G B, and a G C. As in 12.3.2 define ||(a, 6)|| = ||a|| + ||6||. This makes A® B 
into a Banach algebra. 

20.5.3. Corollary. If A and B are Banach spaces, then so is their direct sum A(BB (with addition, 
scalar multiplication, and norm defined as in the preceding proposition). 

20.5.4. Exercise. Identify products and coproducts (if they exist) in the category BALG of 
Banach algebras and continuous algebra homomorphisms and in the category of Banach algebras 
and contractive algebra homomorphisms. (Here contractive means ||Tx|| < ||x|| for all x.) 

20.5.5. Proposition. In a Banach algebra the operations of addition and multiplication (regarded 
as maps from A(BA to A) are continuous and scalar multiplication (regarded as a map from M.(BA 
to A) is continuous. 

20.5.6. Proposition. Let a be an element of a unital Banach algebra A. If \\a\\ < 1, then 1 — a G 
invvl and (1 — a)^^ = YlT=o'^^- 

Hint for proof . Show that the sequence (1, a, a} , . . . ) is summable. (In a unital algebra we take 
to mean 1.) 

o 

20.5.7. Proposition. If A is a unital Banach algebra, then inv A C A. 

Hint for proof . Let a G invvl. Show, for sufficiently small h, that 1 — a~^h is invertible. 

20.5.8. Proposition. If a belongs to a unital Banach algebra and \\a\\ < 1, then 

11(1 -a)-i- 111 < 



1 



20.5.9. Proposition. Let A be a unital Banach algebra. The map a i— )• a ^ from inv A into itself 
is continuous. 

20.5.10. Proposition. Let A be a unital Banach algebra. The map r: a ^ from inv A into 
itself is differentiable and at each invertible element a, we have dra{h) = —a^^ha^^ for all h £ A. 

20.5.11. Notation. Let / be a complex valued function on some set S. Denote by Zf the set of 
all points x in 5 such that f{x) = 0. This is the ZERO set of /. 

20.5.12. Example. The invertible elements in the Banach algebra C{X) of all continuous real or 
complex valued functions on a compact Hausdorff space X are the functions which vanish nowhere. 
That is, 

invC(X) = {/ G C{X) : Zf = %}. 
This is not true in general of either C[,{X) or Cq{X) when X fails to be compact. 
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20.6. Hilbert spaces 

We now make an abrupt change of convention. Heretofore we have taken vector spaces (with or 
without extra structure) to have real scalars unless the contrary is stated. In the study of Hilbert 
spaces however it is convenient to let the default case be complex scalars. 

20.6.1. Definition. Let y be a complex vector space. A function which associates to each pair of 
vectors x and y in V a complex number {x, y) is an INNER PRODUCT (or a DOT product) on V 
provided that the following four conditions are satisfied: 

(a) If X, y, z G V, then 

{x + y,z) = {x,z) + {y,z). 

(b) If X, 2/ € V, then 

{ax,y) = a{x,y). 

(c) If X, y G V, then 

{x,y) = {y,x). 

(d) For every nonzero x in y we have (x, x) > 0. 

Items (a)-(d) say that the inner product is a positive definite conjugate symmetric sesquilinear 
form. When a vector space has been equipped with an inner product we define the NORM (or 
length) of a vector x by 

||x|| := ^/{x^■, 

A Hilbert space is a complete inner product space. (Here, of course, we intend completeness 
of the metric space induced by the norm induced by the inner product.) 

20.6.2. Proposition. If x, y, and z are vectors in an inner product space and a EC, then 

(a) {x,y + z) = {x,y) + {x,z), 

(b) {x,ay) = a{x,y), and 

(c) (x, x) = if and only if x = 0. 

20.6.3. Example. For vectors x = (xi, X2, . . . , x„) and y = {yi,y2, ■ ■ ■ , yn) belonging to define 

n 

{x,y) =^xkyk. 

k=l 

Then C" is an inner product space. 

20.6.4. Example. Let I2 be the set of all square summable sequences of complex numbers. (A 
sequence x = (xfc)^j^ is SQUARE SUMMABLE if YlT=i\^k\'^ < °°-) vectors x = (xi,X2, . . . ) and 
y = (yi) 2/2, ■ • • ) belonging to I2 define 

00 

{x,y) = ^XkVk. 

k=l 

Then I2 is a Hilbert space. (It must be shown, among other things, that the series in the preceding 
definition actually converges.) 

20.6.5. Example. For a < 6 let C([a, 6], C) be the family of all continuous complex valued functions 
on the interval [a, b]. For every f,gE: C{[a, b]) define 



if, 9) = / f{x)g{x)dx. 

J a 

Then C([a, 6]) is an inner product space. 
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20.6.6. Example. Let {S, ji) be a positive measure on space. A measurable function / is SQUARE 
INTEGRABLE if jg\f{x)\^ dii{x) < oo. As we have seen the family of (equivalence classes of) square 
integrable functions on S is denoted by L2(<S'). For every f,g& L2{S) define 

if, 9) = / fix)g{x)dfi{x) . 
Js 

This is an inner product under which L2(-S') becomes a Hilbert space. 

20.6.7. Theorem. In every inner product space the Schwarz inequality 

\{x,y)\ < \\x\\ \\y\\. 

holds for all vectors x and y. 

20.6.8. Proposition. If (xn) is a sequence in an inner product space V which converges to a vector 

a GV, then {xn,y) — > {a,y) for every y £V. 

Every inner product space is a normed linear space. 

20.6.9. Proposition. Let V be an inner product space. For every x & V define 

\\x\\ := . 

Then the rnxip x i— t- ||a;|| is a norm on V . 

20.6.10. Definition. Vectors x and y in an inner product space V are orthogonal (or perpen- 
dicular) if (x, y) = 0. In this case we write x ± y. Subsets A and B oi V are ORTHOGONAL if 
a _L 6 for every a e A and b e B. In this case we write A ± B. 

20.6.11. Definition. If M and A'^ are subspaccs of an inner product space V we use the notation 
V = M (B N to indicate not only that V is the (vector space) direct sum of M and N but also that 
M and A^ are orthogonal. Thus we say that V is the (internal) orthogonal direct SUM of M 
and AT. 

20.6.12. Proposition. Let a be a vector in an inner product space V. Then a J- x for every x eV 
if and only if a = 0. 

20.6.13. Proposition (The Pythagorean theorem). If x J- y in an inner product space, then 

II I Il2 II ||2 , II ||2 

\\x + y\\ = \\x\\ + ||y|| . 

20.6.14. Example. In let M be the X-axis and L be the line whose equation is y = x. If 
we think of as a (real) vector space, then it is correct to write M? = M ® L. If, on the other 
hand, we regard M? as a (real) inner product space, then / M © L (because M and L are not 
perpendicular) . 

20.6.15. Proposition. Let V be an inner product space. The inner product on V, regarded as a 
map from V ®V into C, is continuous. So is the norm, regarded as a map from V into M. 

Concerning the proof of the preceding proposition, notice that the maps {v,v') i— t- \\v\\ + \\v'\\, 
{v,v') + ll'w'lp, and {v,v') max{||i;||, \\v'\\} are all norms on V (BV. Which one is 

induced by the inner product on F © F? Why does it not matter which one we use in proving that 
the inner product is continuous? 

20.6.16. Proposition (The parallelogram law). If x and y are vectors in an inner product space, 
then 

\\x + y\f + \\x - y|p = 2||x|p + 2||y|p . 

20.6.17. Example. Consider the space C([0, 1]) of continuous complex valued functions defined 
on [0, 1]. Under the UNIFORM NORM 

ll/IU :=sup{|/(x)|:0<x<l} 

C([0, 1]) is a normed linear space. There is no inner product on C([0, 1]) which induces this norm. 
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Hint for proof. Use the preceding proposition. 

20.6.18. Proposition (The polarization identity). // x and y are vectors in an inner product 
space, then 

{x,y) = + y|P - \\x - y|p + i ||x + -i\\x - . 

20.6.19. Notation. Let V be an inner product space, x e V, and A, B CV. If x _L a for every 
a e A, we write x -L A; and if a _L 6 for every a e A and b e B, we write A ± B. We define A-^, 
the ORTHOGONAL COMPLEMENT oi A, to he {x E V : X ± A}. We write A-^-^ for (^"'")'''. 

20.6.20. Proposition. If A is a subset of an inner product space V , then A-^ is a closed linear 
subspace of V. 

20.6.21. Theorem (Ricsz-Frechet Theorem). If f E V* where V is an inner product space, then 

there exists a unique vector a in V such that 

f{x) = {x,a) 

for all X in V. 

20.6.22. Convention. In the context of Banach (and, in particular, Hilbert) spaces the word 
"subspace" will always mean closed vector subspace. To indicate that M is a subspace of a Banach 
space B we write M ^ B. A (not necessarily closed) vector subspace of a Banach space is often 

called by other names such as linear subspace or linear manifold. 

20.6.23. Definition. Let ^ be a nonempty subset of a Banach space B. We define the CLOSED 
LINEAR SPAN of A (denoted by Y A) to be the intersection of the family of all subspaces of B which 
contain A. This is frequently referred to as the smallest subspace of B containing A. 

20.6.24. Proposition. The preceding definition makes sense. It is equivalent to defining y A to 
be the closure of the (linear) span of A. 

20.6.25. Theorem (Minimizing Vector Theorem). If C is a nonempty closed convex subset of a 
Hilbert space H and a G C^, then there exists a unique b E C such that \\b — a\\ < \\x — a\\ for every 
xGC. 




C 



20.6.26. Example. The vector space M? under the uniform metric is a Banach space. To see that 
in this space the minimizing vector theorem does not hold take C to be the closed unit ball about 
the origin and a to be the point (2,0). 

20.6.27. Example. The sets 

Ci = |/gC([0,1]): '^-///=i} C2 = |/GLi([0,l]): _^'/ = l} 

are examples that show that neither the existence nor the uniqueness claims of the minimizing 
vector theorem necessarily holds in a Banach space. 
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20.6.28. Theorem (Vector decomposition theorem). Let H be a Hilhert space, M he a subspace 
of H, and x € H. Then there exist unique vectors y £ M and z £ M-^ such that x = y + z. 

20.6.29. Proposition. Let H be a Hilbert space. Then the following hold: 

(a) if M C H, then M C M^^ ; 

(b) ifMCNC H, then C ; 

(c) if M is a subspace of H, then M = M-^-^; and 

(d) ifMCH, then \/ M = M^^ . 

20.6.30. Definition. Let V and W be inner product spaces. Recall (from 8.3.4) that if, for {v, w) 

and {v\ w') in y x and a G C, we define 

(?;, w) + {v , w') = {v + v' ,w + w') 

and 

a{v, w) = {av, aw) 

then V X W becomes a vector space, V © W, which is the (external) direct sum of V and W. To 
make it into an inner product space define 

{{v,w), {v',w')) = {v,v') + {w,w'). 

This makes the direct sum of V and W into an inner product space. It is the (external orthog- 
onal) DIRECT SUM of V and W and is denoted by V ®W. 

Notice that the same notation © is used for both internal and external direct sums and for both 
vector space direct sums (see definition 8.3.4) and orthogonal direct sums. So when we see the 
symbol ^ © it is important to know which category we are in: vector spaces or inner product 
spaces, especially as it is common practice to omit the word "orthogonal" as a modifier to "direct 
sum" even in cases when it is intended. Notice also that, despite the fact that every inner product 
space "is" a normed linear space, the norm on the direct sum V (BW oi two inner product spaces 
is not the same as the norm on y © when regarded as the product of two normed linear spaces 
(see 12.3.2). 

20.6.31. Example. If H and K are Hilbert spaces so is their direct sum H (B K. 

20.6.32. Definition. A mapping T: V ^ W between complex vector spaces is antilinear if 
T{x + y) = Tx + Ty and T{ax) = aTx for all x, y G V and a £ C A bijective antilinear map is 

an ANTI-ISOMORPHISM. 

20.6.33. Example. Every Hilbert space is isometrically anti-isomorphic to its dual space. 
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ALGEBRAS AND LATTICES OF CONTINUOUS FUNCTIONS 

21.1. Banach Lattices 

21.1.1. Definition. A norm || • || on a Riesz space is a lattice norm (or a RiESZ norm) if 
|x| < |y| implies ||x|| < ||y|| for all x, y £ E. A Riesz space equipped with a lattice norm is a 
NORMED Riesz space. A complete normed Riesz space is a Banach lattice. 

21.1.2. Example. Under pointwise operations and partial ordering M" is a Banach lattice for 
every n G N. 

21.1.3. Example. If 21 is an algebra of subsets of a set S, then 5im(S') is a Banach lattice. (See 
example 19.1.5.) 

21.1.4. Example. Let X be a nonempty compact Hausdorff space. Under the usual pointwise 
operations C{X) is a Banach lattice. (See example 20.4.3.) 

21.1.5. Example. Let S* be a nonempty set. The family B{S) of bounded real valued functions on S 
is a Banach lattice (under the usual pointwise operations and uniform norm). (See example 20.4.4.) 

21.1.6. Example. Let X be a nonempty topological space. Under the usual pointwise operations 
Cb{X) is a Banach lattice. (See example 20.4.5.) 

21.1.7. Example. Let X be a nonempty locally compact Hausdorff space. Under the usual 
pointwise operations Cq^X) is a Banach lattice. (See example 20.4.6.) 

21.1.8. Example. If (S, /u) is a positive measure space, then Loo(5') is a Banach lattice. (See 
example 20.4.9.) 

21.1.9. Example. If {S,n) is a positive measure space, then Li(5') is a Banach lattice. (See 
proposition 20.4.10.) 

21.1.10. Example. Let (5, /i) be a positive measure space and 1 < p < oo. Then Lp(S') is a 
Banach lattice. (See example 20.4.17.) 

21.1.11. Example. Let 21 be an algebra of subsets of a set S. Notice that while the norm on 
ba(S') given in example 12.1.18 is not a lattice norm the one given in example 16.5.19 is. Under 
this latter norm ba(S') is a Banach lattice. So too is ca(5) whenever 21 is a cr-algebra of subsets 
of S. (See example 20.4.20 and its proof.) 

21.1.12. Proposition. Every normed Riesz space is Archimedean. 

21.1.13. Proposition. In a normed Riesz space every band is closed. 

21.1.14. Proposition. Let E be a normed Riesz space. Then 

(a) |||x| II = ||x|| for all x £ E, 

(b) the map x x'^ is uniformly continuous, and 

(c) the map x i— )• |x| is uniformly continuous. 

Hint for proof . For (b) show first that — y^\\ < ||x — y\\ and for (c) show |||x| — \y\ || < 

\\x - y\\- 

21.1.15. Proposition. If E is a normed Riesz space, then its (norm) dual E* is a Banach lattice. 
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Hint for proof . Show that E* is an order ideal in . 

21.1.16. Proposition. If E is a Banach lattice, then its norm dual and order dual are identical 
(that is, E* = E^). 

Hint for proof . Show first that the norm dual of any normed Riesz space is an order ideal in 
its order dual. Then argue by contradiction. Let / E E'^ \ E*. Choose unit vectors Xn in E so 
that \f{xn)\ > for every n. Let y„ = X]fc=i ^"^l^fcl- Show that (y„) converges to some element 
b and that < y„ < 6 for every n. Deduce that n < n~^|/(x„)| < |/|(6) for every n. (Or see [2], 
theorem 24.10.) 

21.1.17. Proposition. // Ji and J2 are closed order ideals in a Banach lattice, then Ji + J2 is 
also a closed order ideal. 

Hint for proof . First show that if xi £ Ji, X2 G J2, and \u\ < |xi +X2I, then there exist "^i G Ji 
and V2 G J2 such that u = vi + V2, and \vi\ < \u\ (for i = 1, 2). Next suppose that a sequence (xn) 
in Ji + J2 converges to a point a in the Banach lattice. Without loss of generality we may assume 
that \\xn — iCn-i|| < n~'^. (Why?) For each n let Un = Xn — Xn-i (with xo = 0). Now decompose 
each Un as above. 

21.2. The Stone- Weierstrass Theorems 

We have seen in examples 20.4.4 and 20.4.3 that if X is a compact HausdorfF space, then the 
space C{X) of continuous real valued functions on X is a closed (unital) subalgebra of the Banach 
algebra B{X) of bounded real valued functions on X. Next we look at a normed subalgebra of 
13{X) (and C{X)) that is not closed. 

21.2.1. Example. The family 'P([— 1,1]) of polynomial functions on the interval [—1,1] is a 
subalgebra of the Banach algebra C( [— 1, 1] ) which is not closed. 

Hint for proof . Let s{t) = \t\ for —1 < t < 1. Recursively define a sequence {pn)'^=o of polyno- 
mials on the interval [—1,1] by po{t) = and Pn+i{t) = Pn{t) + 5 [^^ ~ {Pn{t)) ] for — 1 < t < 1 and 
n G Z"*". Use induction to show that 

2\t\ 2 

< \t\-pnit) < ^ , ' ' I < 



2 + n\t\ ~ 2 + n 
for — 1 < t < 1 and n G Conclude that pn ^ s (unif). 

The preceding example shows that the family P([— 1,1]) of polynomials on [—1,1] is not 
closed in ;S([— 1,1]). This leads naturally to the question: What is the closure of P([— 1,1]) 
in B[ [—1, 1] )? We know from proposition 12.1.17 that the uniform limit of a sequence of continu- 
ous functions is continuous, so 'P( [— 1, 1] ) C C( [— 1, 1] ). Since the closure of a normed subalgebra 
is again a subalgebra, we know that 'P( [— 1, 1] ) is a closed subalgebra of C( [— 1, 1] ). That it is, 
in fact, equal to C([— 1,1]) is by no means obvious (but true); this is the classical Weierstrass 
approximation theorem. We will obtain this result as a corollary to the substantially more general 
Stone- Weierstrass theorem. The first (and hardest) step in deriving this celebrated result will be 
to prove a lattice-theoretic version of the theorem (in 21.2.3). 

21.2.2. Definition. Let X be a topological space. We will say that a family of continuous 
functions on X possesses the two-point duplication property with respect to a function 
/ G C{X) if for every pair of points x ^ y in X there exists h £ which agrees with f at x and y. 

21.2.3. Theorem (Stone- Weierstrass theorem: lattice version). Let X be a compact topological 
space. If a sublattice A of C{X) has the two-point duplication property with respect to a function 
f G C{X), then f £A. 



21.3. SEMICONTINUOUS FUNCTIONS 165 

Hint for proof. Given e > we want to find a function h € A sucli tliat ||/ — < e. For every 
pair p and q of distinct points in X choose a function hpq G A which agrees with f at p and q. Let 

U{p,q) = {x e X: hpg{x) < f{x) + e} 

and 

V{p, q) = {xeX: hpq{x) > f{x) - e] . 

For fixed q ^ X find pi, . . . Pm in ^ such that {U (pi,q), . . . U{pm,q)} covers X. Let hg = AjLi ^PjQ 
and V{q) = flJLi ^(PJ^q)- Then find qi, . . . ,qn in X such that {V{qi), . . . , V{qn)} covers X. Let 
h = \ll=i hg,. 

21.2.4. Proposition. Let X be a compact topological space. Every closed subalgebra A of C{X) is 
a sublattice. 

Hint for proof. Show that |/| belongs to A whenever / does. Deal first with the case 
ll/IU < 1- Use example 2L2.L 

21.2.5. Proposition. Let X be a topological space. If A is a separating unital subalgebra of C{X), 
then A has the two-point duplication property with respect to every f G C{X). 

21.2.6. Theorem (Stone- Weierstrass theorem: real version). If X is a compact topological space, 
then every separating unital subalgebra of C{X) is dense in C{X). 

21.2.7. Corollary (Weierstrass approximation theorem). Every continuous function on the closed 
bounded interval [a, 6] can be approximated uniformly by polynomials. (That is, every continuous 
function is the uniform limit of a sequence of polynomials.) 

21.2.8. Proposition. The Banach algebra C( [a, 6] ) is separable (and therefore second countable). 

21.2.9. Proposition. Let / G C( [0, 1] ) . // /J x"/(x) dx = /orn = 0, 1, 2, . . . , thenf = 0. 

21.2.10. Proposition. Let Q be the set of all functions f in C( [0, 1] ) such that f is differentiate 
on (0, 1) and f'{\) = 0. Then Q is dense in C( [0, 1] ). 

21.2.11. Exercise. Let be a closed subset of [0,oo). Give a useful necessary and sufficient 
condition on F for the following to hold: every continuous real valued function can be uniformly 
approximated on F by polynomials in x^. 

/n x'"' f{x) dx 

21.2.12. Exercise. Find lim — — for continuous real valued functions / on [0, 1]. 

/o dx 

21.2.13. Definition. A family of complex valued functions is self-adjoint if it is closed under 
complex conjugation. 

21.2.14. Theorem (Stone- Weierstrass theorem: complex version). If X is a compact topological 

space, then every unital separating self-adjoint subalgebra ofC{X,C) is dense in C{X,C). 

21.2.15. Theorem (Generalized Stone- Weierstrass theorem). If X is a compact topological space 
and A is a closed unital subalgebra ofC{X), then there exists a compact H aus dor ff space Y such 
that A is isometrically isomorphic to C(Y). 

21.3. Semicontinuous Functions 

21.3.1. Definition. A real valued function / on a topological space X is LOWER semicontinuous 
at a point a in X if for every e > there exists a neighborhood U of a such that f{x) > f{a) — e 
whenever x e U. The function / is upper semicontinuous at a if for every e > there exists a 
neighborhood U oi a such that f{x) < f{a) + e whenever x £ U. 
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21.3.2. Proposition. Let f be a real valued function on a topological space X. Then 

(a) / is lower semicontinuous if and only oo,a] is closed for every a E M; 

(b) / is upper semicontinuous if and only if f^ [a, oo) is closed for every a G M; and 

(c) / is lower semicontinuous if and only if —f is upper semicontinuous. 

21.3.3. Remark. It involves only routine computations to show that the sum of two lower semi- 
continuous functions is also lower semicontinuous and that the product of two positive lower semi- 
continuous functions is lower semicontinuous. The preceding sentence remains true if "lower" is 
replaced in each case by "upper" . 

21.3.4. Proposition. A subset A of a topological space X is open if and only if the characteristic 
function Xa "i-^ lower semicontinuous. The set A is closed if and only if Xa upper semicontinuous. 



21.4.1. Definition. A topological space {X,%) is normal if disjoint closed sets can be separated 
by open sets. A T4 space is a normal space which is also Ti. 

21.4.2. Example. The indiscrete topology on a set with at least two elements shows that a normal 
topological space need not even be Ti. 

21.4.3. Proposition. A Hausdorff topological space X is normal if and only if it satisfies the 
following condition: If A and C are subsets of X such that A -< C, then there exists a set B in X 
such that A ^ B ^ C. 

We can improve on proposition 17.3.9. 

21.4.4. Proposition. The following implications hold for topological spaces: 



A plentiful supply of normal topological spaces is guaranteed by the next two results. 

21.4.5. Example. Every compact Hausdorff space is normal. 

Proof. Let A and B be disjoint closed subsets of a compact Hausdorff space X. Fix a €z A; 
then for each b £ B choose disjoint neighborhoods Uh of a and of b. Because B is compact there 
exist 61, . . . , 6„ in i? such that U^^-^VJ,^ ^ B. Notice that U^^-^Vb^ is disjoint from Ci^^iUh^ (which 
is a neighborhood of a). 

The preceding paragraph shows that for each a £ A there exist disjoint open sets Wa containing 
o and Ya containing B. Since A is compact, there exist oi, . . . , S ^ such that U^^^Wafe 2 A. 
Then U^^Wa^ and Pl^-^Y^j. are disjoint open sets containing A and B respectively. □ 

21.4.6. Example. Every metric space is normal. 

21.4.7. Proposition. Let {Aj.) and (Bk) be sequences of subsets of a normal topological space X 
and let S and T be subsets of X . If for every n £ N we have An S ~< Bn and An -< T Q Bn, 
then there exists a closed set F such that An -< F ~< Bn for every n £N. 

Proof. There exists Ci C X such that Ai ^ Ci ^ T [since Ai -< T]. There also exists Di C X 
such that S U Ci ^ Di ^ Bi [ since S ^ Bi and Ci -< Bi]. Proceeding inductively, suppose that 
Ci, . . . ,Cn, Di, . . . , Dn have been chosen in such a fashion that A^ Ck < T and S -< -< B^ 
for k G N„. Now there exists Cn+i ^ X such that 



[since An+i -< T and An+i Q S ~< Dk for k £ N„]. Furthermore, there exists -D^+i ^ X such that 
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Ti^T^i ^Tg ^Ti. 




(21.1) 




(21.2) 
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[since S -< Bn+i and -< T C Bn+i for k € N„+i]. Taking F = U^^C^, we see [using (21.1) 
and (21.2)] that for each n G N 

21.5. The Hahn-Tong-Katetov Theorem 

21.5.1. Theorem (Hahn-Tong-Katetov Theorem). Let X be a normal topological space, g: X 

be upper semicontinuous, and /i: X — )• M be lower semicontinuous with g < h. Then there exists a 

continuous function / : X — t- M such that g < f < h. 

Proof. The folfowing is a somewhat expanded version of the proof found in [41], Theorem 
6.4.4. 

Define functions H : Q ^ ^{X) and G: Q ^ ^{X) by 

H{r) = h^{—oo,r] and G{r) = g'^ {—oo,r). 

Notice that H{r) is closed and G{r) is open for every rational number r. Notice also that if r and 
s are rational numbers such that r < s, then H{r) -< G{s). 

Let t: N — )• Q be an enumeration of Q. Since {H{r): r G Q and r < ti} and {G(s): s G 
Q and s > ti} are countable families of sets and since H{r) C H{ti) -< G{s) and H{r) -< G{ti) C 
G{s) whenever r and s are rational numbers satisfying r < ti < s, we may use proposition 21.4.7 
to assert the existence of a closed set Fi ^ X such that H{r) -< Fi ~< G{s) whenever r and s are 
rational numbers satisfying r < ti < s. 

Proceeding inductively, we suppose that closed sets Fi,. . . , F„ C X have been chosen in such 
a way that 

(i) H{r) -< Fk ~< G{s) whenever k £ N„ and r and s are rational numbers satisfying r < t^ < s; 
and 

(ii) Fj -< Fk whenever j, k £ Nn and tj < tk- 
Now define 

(£" = {Ffc : E N„ and tk < Wi}, 
(£+ = {Fk : A; G N„ and tk > Wi}, 
5 = i/(Wi)u(|Jg-), 

r = G(t„+i)n(f|€+), 

on = U {H{r) : r G Q and r < and 
= ^+ U {G{s) : s G Q and s > t„+i}. 

It is clear that SOT and 5^1 are countable families of sets. A little patience — but no imagination — 
is required to establish that M <^ S < N and M C N whenever M and N e ^. Thus we 
may again make use of proposition 21.4.7 to obtain a closed set Fn+i ^ X such that M -< Fn+i -< N 
whenever M e Tl and iV G 9T. 

It is easy to see that H{r) -< -Fn+i ^ G{s) whenever r and s are rational numbers satisfying 
r < tn+i < s [since r < tn+i implies H{r) G Tl, and s > tn+i implies G{s) G 5^1]. If A; G N„ and 
tk < tn+i, then Fk G CI Tl; so Fk -< Fn+i- If, on the other hand, G N„ and tk > tn+i, then 
Fk G so -< Fk- This completes the induction argument. Thus we conclude that (i) 

and (ii) above, hold for all n G N. 

Now regard F as a function from N into ^{X); that is, i^: N — )• ^{X) : k i— )• Fk- Define a new 
function F: Q — )• ^{X) hy F = F o t~^. In other words F{tk) = Fk for each rational tk- The 
preceding inductive construction gives us two important properties of F: 

(i) H{r) -< F(u) -< G{s) whenever r, u, and s are rational numbers such that r < u < s; and 

(ii) F{r) -< F(^s) whenever r and s are rational numbers such that r < s. 



168 



21. ALGEBRAS AND LATTICES OF CONTINUOUS FUNCTIONS 



Let X € X and r € Q and suppose x ^ Fir)- Then whenever s < r, we have x ^ H{s) = 
h^(—oo, s]. Thus for s < r we have h{x) > s. Therefore 

h{x) > r whenever x ^ F{r). (21-3) 

It follows from (21.3) that every x in X is in at least one F{r). 

Similarly, if x G X and r € Q and x G F(r), then for s > r we have x G G{s) = g*^{—oo, s) and 
thus g{x) < s. Therefore, 

g{x) < r whenever x G F(r). (21-4) 

It follows from (21.4) that every x £ X fails to be a member of at least one F{r). We make one 
more simple observation: 

inf{s G Q: X G F{s)} = sup{r G Q: x ^ F(r)} (21.5) 

Now define a function /(x) = inf{s G Q: x G F(s)}. That dom(/) = X and ran / C M follows 
from (21.3) and (21.4)above. From (21.4) we also obtain /(x) = inf{r G Q: x G F{r)} > g{x). 
From (21.3) and (21.5) we obtain /(x) = sup{r G Q: x ^ F{r)} < h{x). Thus g < f < h. 

It remains to show that /: X — t- M Is continuous. Let a £ X. Given e > choose rational 
numbers r, si and s such that 

/(o) - e < r < f{a) < si < s < f{a) + e. 

Clearly a ^ F{r). Since a G F{si) < F{s), we see that a G (i^(s))°. Let U = {F{s)y \ F{r). 
Then [/ is a neighborhood of a. Suppose that x £ U. Then x G -F(s) and so /(x) = inf{ti G 
Q: X G F{u)} < s. Similarly, x ^ F(r) and so /(x) = sup{n G Q: x ^ F{u)} > r. Therefore, 
/(a) — € < r < /(x) < s < f{a) + e. This shows that / is continuous at a. □ 

21.5.2. Example. Give examples to show that the preceding theorem fails if we replace the 
assumptions that g is upper semicontinuous and h is lower semicontinuous with any of the following: 

(a) g is lower semicontinuous and h is upper semicontinuous; 

(b) both g and h are lower semicontinuous; 

(c) both g and h are upper semicontinuous. 

21.5.3. Theorem (Tietze extension theorem (real version)). If A is a closed subset of a normal 
topological space X and if /o G C{A) is hounded, then there exists a continuous extension f of /o 
to all of X such that sup(ran/) = sup(ran/o) and inf(ran/) = inf(ran/o). 

Tietze's theorem can be generalized to not necessarily bounded functions. 

21.5.4. Theorem. If A is a closed subset of a normal topological space X and if fo G C{A), then 
there exists a continuous extension f of fo to all of X . 

21.5.5. Definition. Let ^ be a subset of a topological space X. A continuous map /: X ^ A is 
a RETRACTION if /(x) = X for all x £ A. Thus a retraction is a continuous surjection from X to 
A which leaves A pointwise fixed. Another way of saying the same thing: / is a retraction (of X 
to A) if it is continuous and if the following diagram commutes. 

X I .A 

A 

When there exists a retraction from X to ^ we say that ^ is a retract of X. 

21.5.6. CAUTION. The preceding is the usual definition of a retraction from a topological space 
to a subspace: thus / is a retraction if the inclusion mapping l of A into X is a right inverse for /. 
More generally, in arbitrary categories the word "retraction" often means "right invertible". And 
"section" and "coretraction" are synonyms for "left invertible". 
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21.5.7. Example. If r > 0, then {zGC:|2;|<r}isa retract of the complex plane C. 

21.5.8. Theorem (Tietze extension theorem (complex version)). If A is a closed subset of a normal 
topological space X and if fo € C{A,C) is bounded, then there exists a (complex valued) continuous 
extension f of fo to all of X such that \\f\\u = ||/olU- 

21.5.9. Theorem (Urysohn's lemma). Any two nonempty disjoint closed subsets of a normal 
topological space can be functionally separated. 

21.5.10. Proposition. Let X be a compact Hausdorff space and f be a positive lower semicontin- 
uous function on X. Then 

/ = sup{<7GC(X): 0<5</}. 

21.5.11. Proposition. Let X be a compact topological space. Then C{X) is separating if and only 
if X is Hausdorff. 

21.5.12. Proposition. Let X and Y be compact Hausdorff spaces and (p: X ^ Y be continuous. 
Define Ccf) on C{Y) by 

C(t>{g) = go4> 

for all geC{Y). 

(a) Then Ccj) maps C{Y) into C{X). 

(b) The pair of maps X i— )• C{X) and (p '-^ Ccf) is a contravariant functor from the category 
of compact Hausdorff spaces and continuous maps to the category of unital commutative 
Banach algebras and continuous unital algebra homomorphisms. 

(c) The homomorphism C(f) is infective if and only if the function (p is surjective. 

(d) The homomorphism C(f) is surjective if and only if the function (j) is infective. 

(e) Suppose X = Y. Then C(j) is idempotent (with respect to composition) if and only if 
(j): X ^ ran (j) is a retraction. 

21.5.13. Corollary. If X and Y are homeomorphic compact Hausdorff spaces, then the Banach 
algebras C{X) and C{Y) are isometrically isomorphic. 

21.5.14. Proposition. Let X be a compact Hausdorff space, F be a closed subset of X , and 
M = {f G C{X) : /(x) = for every x £ F}. Then C{X)/M is isometrically isomorphic to C{F). 

21.6. Ideals in C{X) 

21.6.1. Proposition. If J is a proper ideal in a unital Banach algebra, then so is its closure. 

21.6.2. Corollary. Every maximal ideal in a unital Banach algebra is closed. 

21.6.3. Proposition. Let J be a proper closed ideal in a Banach algebra A. On the quotient 
algebra A/ J (see definition 8.2.12) define a function 

II II : ^/J M: a 1-^ inf{||n|| : u G a} . 

This function is a norm on A/ J , under this norm A/ J is a Banach algebra, and the quotient map 
is continuous with ||7r|| < 1. The Banach algebra A/ J is the QUOTIENT ALGEBRA of A by J. 

21.6.4. Proposition. Let I be a proper closed ideal in a unital commutative Banach algebra A. 
Then I is maximal if and only if A/ 1 is a field. 

21.6.5. Example. For every subset C of a topological space X the set 

Jc:={/eC(X):r(C7) = {0}} 

is an ideal in C{X). Furthermore, Jc 5 Jd whenever C C D C X. (In the following we will 
write Jx for the ideal Js^x}-) 

21.6.6. Proposition. Let X be a compact topological space and I be a proper ideal in C{X). Then 
there exists x £ X such that I C Jx. 
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21.6.7. Proposition. Let x and y be points in a compact Hausdorff space. If Jx ^ Jy, then x = y. 

21.6.8. Proposition. Let X be a compact Hausdorff space. A subset I ofC{X) is a maximal ideal 
in C{X) if and only if I = Jx for some x £ X. 

21.6.9. Corollary. If X is a compact Hausdorff space, then the map x i— )• Jx from X to Max C{X) 
is bijective. 

Compactness is an important ingredient in proposition 21.6.8. 

21.6.10. Example. In the Banach algebra Cb((0,l)) of bounded continuous functions on the 
interval (0,1) there exists a maximal ideal / such that for no point x G (0,1) is / = Jx- Let / 
be a maximal ideal containing the ideal S of all functions / in C;,( (0, 1) ) for which there exists a 
neighborhood Uj of in M such that f{x) = for all x £ C// H (0, 1). 

21.6.11. Proposition. Let X be a compact Hausdorff space, C the family of a;; closed subsets of 
X , and ^ be the family of all closed ideals in C{X). Then the map 

J:C ^Z- C ^ Jc 

is a bisection. 



CHAPTER 22 



FUNCTIONS OF BOUNDED VARIATION 

In these notes a real valued function on a subset j4 of M is increasing if f{x) < f{y) whenever 
X < y and x, y £ A. If we want f(x) < f{y) whenever x < y we will say that / is strictly 
INCREASING. (Of course we adopt a similar convention for the terms decreasing and strictly 
DECREASING.) A function is monotone if it is either increasing or decreasing. And it is strictly 
monotone if it is strictly increasing or strictly decreasing. 

It is not a profound observation that in the vector space of real valued functions on an interval 
[a, b] (with a < b) the set I = I{[a, b]) of increasing functions is a convex cone (see proposition 9.3.2). 
The cone is not proper: the only conclusion one can make from f £ ID (—T) is that / is constant. 
To produce a proper cone the most obvious move is to factor out the constant functions; that is, 
to regard two functions as equivalent if they differ by a constant. Equivalently, consider the family 
Iq = 2o([a, b]) of increasing functions / on [a, b] such that /(a) = 0. Clearly Iq is a proper convex 
cone in the vector space J- = J-{[a,b]) of real valued functions on [a,b]. By proposition 9.3.4 this 
cone induces a partial ordering on which makes J- into an ordered vector space with Iq as its 
positive cone. It is obvious that it does not generate all of since differences of functions in Iq 
would necessarily vanish at a. But we do know (see remark 9.3.9) that it generates some ordered 
vector space contained in J^. Could it be Tq, the set of all functions on [a,b] which vanish at a? 
Can every real valued function be expressed as the difference of two increasing functions? The 
answer to this turns out to be no. If a functions wriggles about too much it may be impossible to 
decompose it into the difference of increasing functions. Which functions then belong to Iq — 2o? 
Answer: the functions of bounded variation for which /(a) = 0. We investigate these functions in 
this chapter. 

There is more at stake here than simply answering the question just posed. The family of 
functions of bounded variation turns out to be quite important. In particular, as we shall see later 
in this chapter, it is a natural domain on which Riemann-Stieltjes integrals operate. We also show 
in this chapter that the family Iq — Xq is a Riesz space. It is an especially interesting Riesz space 
because the ordering on the space is not pointwise: a function / is "less than" a function g if (and 
only if) the function g — f \s increasing. 

22.1. Preliminaries on Monotone Functions 

22.1.1. Definition. Let / be a real valued function on an interval J in M. (We are not committing 
ourselves to J's being closed or open or bounded or unbounded.) Suppose that c G J and that 
(c, oo) n J / 0. This says that c is either an interior point of J or, in case J is bounded below, 
a left-hand endpoint of J. (The second supposition merely prohibits c from being a right-hand 
endpoint, for then (c, oo) n J would be empty.) In this situation we say that a number /(c+) is 
the RIGHT-HAND LIMIT of / at c if for every e > there exists (5 > such that \ f{x) — /(c+)| < e 
whenever c < x < c + 5. 

A real valued function / defined on an interval J in M is right-CONTINUOUS at a point c if c 
belongs to J = dom/ and if for every e > there exists 5 > such that \f{x) — /(c) | < e whenever 
X £ (c, c + 5) n J. 

22.1.2. CAUTION. There is a not-very-interesting technical detail that should be noted about 
the terms just defined. As stated the definition does not imply that being right-continuous at a 
point and having a right-hand limit there are necessarily related. Take for example the identity 
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function x i— )• x on the interval (0, 1]. It has a right-hand hmit at but is not right-continuous 
there (because it is not defined at 0). On the other hand it is right-continuous at 1 (notice that no 
matter what 6 is chosen, the set (1, 1 + 6) D (0, 1] is empty) but has no right-hand hmit there. 

22.1.3. Exercise. Give a definition analogous to the preceding one for the left-hand limit f{c—) 
of a function / at a point c and for the term left- continuous. 

It follows from remark 11.2.2 that a real valued function on an interval is continuous at a point 
in its domain if and only if it is both left- and right-continuous there. 

22.1.4. Proposition. Let f : [a,b] — )• M be an increasing function. At each c G {a,b] the function 
f has a left-hand limit and f{c—) < f{c). At each c G [a,b) the function f has a right-hand limit 
and f{c+)>f{c). 

22.1.5. Proposition. Let f : [a, 6] — t- M be an increasing function. If a < c < d < b, then 
f{c+)<f{d-). 

22.1.6. Corollary. An increasing function on an interval in M has at most countably many dis- 
continuities. 

22.2. Variation 

22.2.1. Convention. Let / be a real valued function defined on the interval [a, b]. Suppose that 
P = {to)^i) • • • )^n} is an ordered (n + l)-tuple of points in [a, 6] such that to = a, tn = b, and 
tfc-i < tk for all k E N„. The (n + l)-tuple P determines a partition of [a, b] which comprises the 
n subintervals [toi^i]) [^15*2]) • • • , [tn-i,tn]- Since the correspondence between such (n + l)-tuples 
of points in [a, b] and partitions of [a, b] into subintervals is bijective we will abuse language by 
referring to the the set P as a partition of [a, b]. 

22.2.2. Definition. Let P = (sq, si, • • • , Sm) and Q = {to,ti, . . . ,tn) be partitions of the inter- 
val [a, b]. Write P ^ Q if the set associated with P is contained in the set associated with Q, that 
is if {sq, Si, ... , Sm} ^ {toi ill • • • ,tn}. In this case we say that Q is a refinement of P. 

22.2.3. Proposition. The set of all partitions of the interval [a,b] is a directed set under the 
relation =4 defined above. If P and Q are partitions of [a, b] then their least upper bound P V Q is 
called the least common refinement o/P and Q. 

22.2.4. Notation. Suppose that / is a fixed real valued function on [a, b] where a < b. For every 
partition P = {tQ, ti, . . . , tn} of [a, b] define 

n 

Wf{P) :=J2\f{t,)-f{tk-i)\. 

k=l 

22.2.5. Proposition. If f is a real valued function on the interval [a,b], then the function Wj 
defined above is an increasing net of real numbers whose domain is the family of partitions of [a, b]. 

22.2.6. Definition. Let / be a real valued function on [a,b]. If the net Wf defined in 22.2.4 is 
bounded, we say that the function / is of bounded variation. According to proposition 16.4.19 
every bounded increasing net of real numbers converges. So if / is a function of bounded variation 
on [a, 6], the corresponding net Wf must converge. Its limit is denoted by V^f and is called the 
TOTAL variation of / over [a,b]. It will be convenient to define the total variation of / over the 
one point interval {a} to be 0; that is, V^f = 0. The family of all real valued functions of bounded 
variation on [a, b] is denoted by BV{[a, b]). 

11.1.1 . Proposition. Every increasing function f on a closed bounded interval [a, b] is of bounded 
variation and V^f = f{b) — f{a). 
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22.2.8. Proposition. Let a < c < b and f : [a, b] — )• M. Then f is of bounded variation on [a, b] if 
and only if it of bounded variation on both [a, c] and [c, b] . Furthermore, if f is of bounded variation 
on [a,b], thenV^f = V^ + V^f. 

22.2.9. Example. Find the total variation of the following functions. 

(a) X I— 7- sin a; on the interval [0, 27r]. 

(b) x^\x^ - - + 15x^ on the interval [-2, 2]. 

22.2.10. Example. A continuous function, even a differentiable function, on a closed and bounded 
interval need not be of bounded variation. For example, consider f{x) = x'^ sin — ^ for < x < 1 
and /(O) = 0. ^ 

22.2.11. Example. A function of bounded variation need not be piecewise monotone. (A function 
/ is PIECEWISE MONOTONE on an interval if there exists a partition (to, • • • , in) of the interval such 

TT 

that / is monotone on each subinterval [tk-i,tk]-) Consider f{x) = sin — for < x < 1 and 
/(O) = 0. 2^ 

22.3. The Variation Norm 

On the family BV{[a, b]) of functions of bounded variation the function / i— )• Vl^{f) is a seminorm. 
We can make this into a norm, which we will call the variation norm, by the simple expedient of 
adding |/(a)| to the total variation Vll{f) of /. Every function of bounded variation on an interval is 
bounded. So as a subspace of B{[a, b\) the set of functions of bounded variation have another norm, 
the uniform norm (see example 12.1.9), which turns out to be always smaller than the variation 
norm. We formalize this in the next proposition. 

22.3.1. Proposition. Let a < b. Then BV{[a,b]) is a vector subspace of B{[a,b]). For every 
f £BVi[a,b]) define 

11/11^ :=|/(a)|+Kf(/). 

The function f i— t- is a norm on BV{[a, b]), which is called the VARIATION NORM. Furthermore, 

for every function f of bounded variation on [a, b] 

\\f\\u<\\f\\y. 

22.3.2. Example. Under the usual pointwise ordering ;BV([a,6]) (where a < 6) is a Riesz space. 
The variation norm has an interesting multiplicative property. 

22.3.3. Proposition. /// and g are functions of bounded variation on the interval [a, 6], then the 
product fg is of bounded variation and 

\\f9\\y<\\f\\u\\g\\y + \\f\\y\\g\\u. 

Thus BV{[a,b]) is an algebra of functions. 

It is fairly clear that if a function / gets arbitrarily close to zero, then its reciprocal 1 // becomes 
unbounded and consequently is not of bounded variation. If, however, the function / is of bounded 
variation and stays some fixed (but perhaps very small) distance from zero, then its reciprocal is 
itself of bounded variation. 

22.3.4. Definition. A real valued function / on a set S is bounded away from zero if there 
exists 6 > such that |/(x)| > 6 for all x £ S. 

22.3.5. Proposition. // a function f is of bounded variation and is bounded away from zero on 
an interval [a,b], then its reciprocal 1/f is also of bounded variation. In fact, if |/(x)| > 6 > for 
a < X < b, then 

V 
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22.3.6. Definition. For a function / of bounded variation on the interval [a, b] define 

vf. [a,b] ^R: x^V^ . 
This function is the VARIATION FUNCTION of /. 

22.3.7. Proposition. If f is a function of bounded variation on [a,b], then its associated variation 
function Vf is right- continuous (or left- continuous, or continuous) at a point c in [a,b] if and only 
if f is. 

22.3.8. Proposition. // / is a function of bounded variation on [a,b], then the function vj + f 
and Vf — f are increasing. 

22.3.9. Corollary. Every function of bounded variation on an interval [a,b] is the difference of 
two increasing functions. 

22.3.10. Proposition. If g and h are increasing functions on [a, b], then so is g -\-h — Vg-h- 
Proposition 22.1.4and corollary 22.3.9 and make the following proposition clear. 

22.3.11. Proposition. // / is a function of bounded variation on the interval [a,b], then it has 
both a left-hand and a right-hand limit at each point of (a, 6). The limits /(«+) and f(b—) also 
exist. 

Corollaries 22.1.6 and 22.3.9 justify the following result. 

22.3.12. Proposition. A function of bounded variation has at most countably many discontinu- 
ities. 

22.4. Another Ordering on i3V([a, 6]) 

We have already seen (in example 22.3.2) that using the family of positive functions on an 
interval [a, b] as a positive cone makes the vector space BV{[a, b]) into a Riesz space. But recall that 
at the beginning of this chapter we set out to do something quite different (and more interesting) . 
We sought to use the family of increasing functions on [a, b] as a positive cone. We observed that 
in order to make the cone proper, we would need to work with the increasing functions that vanish 
at a. In this section we pursue the promised construction. 

22.4.1. Notation. Let Xo([a, 6]) be the family of all increasing functions on the interval [a, 6] 
such that /(a) = 0, and let BVo{[a.,b]) be the family of all functions of bounded variation on the 
interval [a, b] with /(a) = 0. 

22.4.2. Example. If a < 6, then BVo([a,^]) (under the induced algebraic operations, pointwise 
ordering, and variation norm) is a normed linear space and a Riesz space. Notice that the variation 
norm is quite simple for such functions: ||/||^ = (/)• 

22.4.3. Proposition. The family Xo([a, 6]) is a proper convex cone in the vector space BV[a, b]. 

By virtue of the preceding proposition and proposition 9.3.4 there exists a partial ordering =^ 
on SVo([a, ^]) which is compatible with the vector space structure. (We use ^ to distinguish this 
partial ordering from the usual pointwise ordering <.) According to proposition 9.3.4 we know that 
f ^ g \i and only \i g — f is increasing. Thus by taking Xo([a, b]) as the positive cone we have made 
;BVo([a, b]) into an ordered vector space. Is it in fact a Riesz space? The work we have done above 
shows that the answer is yes. 

22.4.4. Example. Under the partial ordering ^ defined above BVo{[a,b\) is a Riesz space whose 
positive cone is the set Xo([a, 6]) of increasing functions on [a, 6] that vanish at a. 

Hint for proof . By proposition 14.1.2 it is enough to show that / V exists for every / E 
i3V([o, b]). Prove that / V = \{vf + f) for every such /. 
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22.5. The Fundamental Theorem of Calculus 

The basic facts concerning differentiable functions between normed linear spaces have, as we 
saw in chapter 13, rather uncomphcated proofs. It may be then somewhat surprising to discover, 
even in the simple case of real valued functions of a real variable, how technically demanding 
matters become when we examine the relationship between differentiation and Lebesgue integration. 
Eventually we determine conditions under which the equation f{x) = f{a) + f dX holds for all 
points X in some interval [a, b] . We know from beginning calculus that a sufficient condition for 
this to hold is that / be differentiable with continuous derivative. But surely this condition is not 
necessary. (Consider the absolute value function on [—1, 1].) Before attempting to find a condition 
that is both necessary and sufficient, let's look at an example which shows how badly things can 
go wrong. 

22.5.1. Example. Recall from example 17.1.31 that the Cantor function (let's call it ^) maps the 
Cantor set C into [0, 1] by taking the point X^fcli ^fc^"'^ (each being or 2) to YlT=i ak'^^^^^^^ ■ 
Recall also that if (a, b) is one of the open intervals removed during the construction of the Cantor 
set, then tp takes on the same value at a and at b. Extend '0 to all of [0, 1] by making it constant 
on the closure of each such interval. That is, set ^p{x) = 'ip{a){= ip{b)) whenever a < x < b. More 
formally define 'ip{x) = sup{ip{c) : c ^ C and c < x} for all x £ [0, 1] \ C. We will call this extension 
of the Cantor function the Lebesgue singular function. (Many authors call both functions the 
Cantor function.) The Lebesgue singular function is increasing and continuous. It is differentiable, 
with derivative zero, almost everywhere so that 

V'(l) - V(0) = 1^0=1 i^dX. 

Jo 

22.5.2. Definition. A function /: [a,6][— M is singular if it is monotone, continuous, non- 
constant, and has zero derivative almost everywhere. The definition of this term appears not to 
be well-standardized. In [10], for example, we find "A function satisfying /' = a.e. is called 
singular." Of course this allows the constant functions to be singular, which seems a bit odd. 

22.5.3. Example. It is an easy result that differentiable functions are continuous. While everyone 
knows that the converse is false, it may come as a surprise how badly it fails. In [ 3] (theorem 
17.7) you can find the details of the construction of a bounded continuous function on the real line 
that has a derivative at no point whatever. 

Even worse: it turns out that, in some appropriate sense, most continuous functions on the real 
line are nowhere differentiable. A full explanation and proof of the assertion are given in theorem 
17.8 of [23]. (Since the proof depends crucially on the Baire category theorem you may want to 
postpone reading this until you are familiar with the material discussed in section 29.3.) 

Monotone functions (continuous or not) are another matter altogether. They must be differ- 
entiable almost everywhere. Working through proofs of this fact can be a surprisingly complicated 
undertaking. Be prepared to learn about such technical matters as Vitali coverings and Dini 
derivates. 

22.5.4. Theorem. An increasing real valued function on the interval [a,b] is differentiable almost 
everywhere. 

Proof. See [2], theorem 29.9; [4], theorem 2.3.9; [10], theorem 20.6; [19], theorem 222A; [23], 
theorem 17.12; ['^T:], theorem 6.2; and [3R], chapter5, theorem 2. 

22.5.5. Corollary. A function of bounded variation on the interval [a, b] is differentiable almost 
everywhere. 

Proof. Proposition 22.3.9. □ 

22.5.6. Proposition. Let f G Ci{[a,b]) where a < b. Define F(x) = f dX for every x £ [a,b]. 
Then F is a continuous function of bounded variation on [a, b] with V^F = /^^|/| dX. 
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Proof. See [10], theorem 20.9 and [32], proposition 6.2. 

The first fundamental theorem of calculus says roughly that for integrable functions differenti- 
ation is almost everywhere a left inverse of indefinite integration. 

22.5.7. Theorem (Fundamental Theorem of Calculus I). Let f G Ci{[a,b]) where a < b. Define 
F{x) = f dX for every x £ [a, b]. Then F is dijjerentiable almost everywhere on [a, b] and F' = f 
a.e. 

Proof. See [10], theorem 20.9; [ 19], theorem 222E; and [3: ], theorem 6.5. 

While it is certainly gratifying to know that the indefinite integral F in theorem 22.5.7 is 
differentiable almost everywhere, we must realize that such a result gives us no information whatever 
about what happens at any particular point in the domain of /. So 22.5.7 should certainly not 
be regarded as a replacement for the much more elementary result from beginning calculus that 
guarantees differentiability of F at points of continuity of its integrand /. 

22.5.8. Proposition. Let f £ Ci{[a,b]) where a < b. Define F(x) = f dX for every x £ [o,b]. 
Then F is differentiable at every point of {a,b) where f is continuous and at such points F' = f . 

22.5.9. Proposition. Every absolutely continuous function on the interval [a,b] is continuous and 
of bounded variation. 

Proof. See [ju], proposition 20.15 and ["''^], theorems 18.12 and 18.13. 

22.5.10. Corollary. Every absolutely continuous function on the interval [a,b] is differentiable 
almost everywhere. 

It is often useful to know that many of the results in this section are also true for complex 
valued functions. The following proposition is the key to making possible such extensions. 

22.5.11. Proposition. If f is a complex valued absolutely continuous function on the interval 
[a,b], then there exist real valued, increasing, absolutely continuous functions fi, fi, fz, and f^, 
such that / = /i - /2 + i(/3 - 74). 

Proof. See [23], theorem 18.13. 

22.5.12. Proposition. An absolutely continuous function defined on an interval in M which has 
zero derivative almost everywhere is constant. 

Proof. See [JO], theorem 20.16 and [/ ], theorem 18.15. 

22.5.13. Example. The Lebesgue singular function is not absolutely continuous. 

22.5.14. Definition. A function / is an indefinite integral on an interval [a, b] if there exists 
an integrable function g on [a, b] such that f{x) = f{a) + gdX for all x € [a, b]. 

22.5.15. Theorem. A function f is an indefinite integral on an interval if and only if it is abso- 
lutely continuous on the interval. 

Proof. See [,>^], propositions 6.6-6.7 and [36], chapter 5, theorem 13. 

The second fundamental theorem of calculus, which says that any absolutely continuous function 
is the indefinite integral of its derivative, is an easy consequence of theorem 22.5.15. 

22.5.16. Corollary (Fundamental theorem of calculus II) . /// is an absolutely continuous function 
on [a,b], then its derivative (exists a.e. and) is integrable on [a,b] and 



/(^) - /(«) 




f'dX 



for all X G [a, b]. 
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22.5.17. Example. The family H of all absolutely continuous complex valued functions / on [0, 1] 
satisfying /(O) = and /' G >C2([0, 1]) is a Hilbert space when we define an inner product on H by 
setting 

Jo 

whenever / and g belong to H. 

Hint for proof. Show first that the map {f,g) i— s- (/, (?) is well-defined and is an inner product. 
Then to show that H is complete note that if (/„) is a Cauchy sequence in H, then the sequence 
(/^) is Cauchy and therefore convergent to some function g in L2([0, 1]). Show that g is integrable 
and let h be its indefinite integral x ^ g dX. Finally verify that h belongs to H and is the limit 
of the sequence (/n)- 

22.5.18. Proposition. Let f be a function of bounded variation on the interval [a,b]. Then 
feCi{[a,b]) andj;;\f'\dX<V^f. 

Hint for proof. We have already seen in corollary 22.5.5 that since / is of bounded variation 
/' exists almost everywhere in [a,b]. Suppose for the moment that / is increasing and has been 
extended to [a,oo) by setting f{x) = f{b) for all x > b. For each n G N and x G [a,b] let 
gn{x) := n[f{x + n~^) — f{x)]. Since f'{x) = linin^oo dnix) almost everywhere, it is clear that /' 
is positive and measurable. Use Fatou's lemma 19.2.10 to show that 



' < f{b) - f{a). 



/ /'dA<liminfn / f dX - f dX 

Ja Ub Ja 



Next assume only that / is of bounded variation (not necessarily increasing) . Use the preceding 
argument to show that J^\f'\ dX < J^v'^ dX < V^f. □ 

22.6. The Riemann-Stieltjes Integral 

22.6.1. Definition. Let J = [a,b] be a fixed interval in the real line, x = {xq,xi, . . . ,Xn) be an 
(n-|- l)-tuple of points of J, and t = (ti, . . . , t„) be an n-tuple. The pair (x; t) is a partition with 
SELECTION of the interval J if: 

(a) Xk-i < Xfc for 1 < A; < n; 

(b) tk G [xk^i,Xk] for 1 < k <n; 

(c) xq = a; and 

(d) Xn = b. 

The family of all partitions with selections of J = [a, b] is denoted by V{J) or Via, b]. 

22.6.2. Notation. If x = (xq, xi, . . . , then {x} denotes the set {xq, xi, . . . , 

22.6.3. Definition. Let P = (x; s) and Q = (y; t) be partitions (with selections) of the interval J. 
We write P =4 Q and say that Q is a refinement of P if {x} C {y}. 

22.6.4. Proposition. Under the relation =4 defined above V{J) is a directed set. 

22.6.5. Definition. Let / and a be bounded functions on the interval J = [a, b] and P = (x; t) 
be a partition of J into n subintervals. Let Aa^ := a{xk) — a(a;fc_i) for 1 < A; < n. Then define 

n 
k=l 

A sum Sf^P) where P G P(J) is a Riemann-Stieltjes sum of / with respect to a on J. Notice 
that since P(J) is a directed set, Sj^a is a net of real numbers. 

22.6.6. Example. Let f{x) = 3x-2 and a{x) = for < x < 4, x = (0,1,2,3,4), t = (1,2,2,3), 
and P = (x, t). Calculate Sf^a{P)- 
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22.6.7. Definition. Let / and a be bounded functions on the interval J = [a, b]. If the net Sf^a of 
Riemann-Stieltjes sums converges we say that the function / is Riemann-Stieltjes integrable 
with respect to a. We denote by Tl{a) the set of all bounded functions / which are Riemann- 
Stieltjes integrable with respect to a. 

The limit of the net Sf^a (when it exists) is the Riemann-Stieltjes integral of / with 
respect to a over the interval J and is denoted by 

rb f-b 

f da or / f{x)da{x). 



The function / is the integrand and a is the integrator. Notice that i the special case where 
a{x) = X on J, the integral is just the Riemann integral (see example 16.4.11). 

22.6.8. Example. Let / be a bounded real valued function on the interval [0, 2] which is continuous 

{0, if < X < 1 
~ ~ . Prove that / is Riemann-Stieltjes integrable with respect 
1, if 1 < X < 2 

to a on [0, 2] and calculate f da. 

22.6.9. Exercise. Using only the definition of the Riemann-Stieltjes integral prove that 

rb 

da{x) = a{b) — a{a) . 



22.6.10. Proposition. If f € Tl{a) and f G '^(/3) on [a,b] and c G M, then f belongs to both 
Tl{a + /?) and to Tl{ca) and 

/•b j-b pb j-b j-b 

/ fd{a + f3)= / fda+ / f d/S and / f d{ca) = c f da . 

J a J a J a J a J a 

22.6.11. Exercise. Prove that if / G Tl{a) on [a, 6], then a G TZif) on [a,b] and 

I pb 

f da + / a df = f{b)a{b) — f{a)a{a) . 



For a fixed integrator function a the family TZ{a) of Riemann-Stieltjes integrable functions is a 
vector space. 

22.6.12. Proposition. // /, g £ TZ{a) on [a, b] and c G M, then f + g and cf also belong to TZ{a) 
and 

/•b rb j-b j-b rb 

/ {f + g) da = f da + g da and / (c/) da = c f da . 

J a J a J a J a J a 

22.6.13. Proposition (Change of Variables) . Suppose that f G TZ{a) on the interval [a,b]. Also 
suppose that g is a strictly increasing function on the interval [c,d] with g{c) = a and g[d) = b. 
Define h = f o g and (5 = ao g. Then h G Tl^jS) on [c, d] and 

rb I'd 

fda= hd/3. 



22.6.14. Proposition (Reduction to Riemann Integral). Let a be a continuously differentiable 
function on the interval [a, 6]. If f £ TZ{a) on [a,b], then the Riemann integral fa' exists and 

b r-b 

f{x)da{x)= I f{x)a'{x)dx. 



Hint for proof . Let g = fa' . Try to find partitions P = (x, t) of [a, b] which make the distance 
between the Riemann sums Sg[P) = '^g{tk)Axk and the Riemann-Stieltjes sums Sf^a{P) = 
^ f{tk) Aofc small. Apply the mean value theorem (see, for example, [ ], theorem 8.4.26) to a' on 
each subinterval [xfc_i, x^] to find points Uk such that Aak = o:'{uk) Axfc. Also use the boundedness 
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of / and the uniform continuity of a' to make \a'{uk) — a'{tk)\ < e/K (for an appropriate K) 
whenever Uj^ and he in the same subinterval of P. 

22.6.15. Definition. Let P = (x, t) be a partition of [a, 6]. Define 

MkU) ■■= sup{/(a:;) : Xk-i < x < Xk} . 

The Riemann-Stieltjes sums 

n 

Uf,a{P) :=^Mfc(/)Aafc 

fc=i 

are the upper RS-SUMS of / with respect to a on [a, b]. Similarly, if 

mfc(/) := inf{/(x) : Xk-i < x < Xk} 
then the Riemann-Stieltjes sums 

n 

Lf^a{P) :=Y.mk{f)Aak 
k=l 

are the LOWER RS-SUMS of / with respect to a on [a, b]. 

22.6.16. Notation. If J is an interval in M we write "/ t on J" to mean that the function / is 
(defined and) increasing on J. 

22.6.17. Proposition. Let a f on [a,b] and P, Q E V[a,b] with P 4Q- Then 

Uf,a (P) > Uf^a (Q) and L (P) < Lf^aiQ) ■ 

22.6.18. Proposition. Let a t on [a,b] and P, Q E V[a,b]. Then 

22.6.19. Definition. Let a t on the interval [a,b]. The UPPER Darboux-Stieltjes INTEGRAL 
of / with respect to a is defined by 

ffda := mf{Uf,„{P) : P G 7^(P)[a,6]}. 

J a 

Similarly, the LOWER Darboux-Stieltjes INTEGRAL of / with respect to a is defined by 

b 

fda := sup{L/,«(P) : P G 7^(P)[a,6]}. 

For simplicity we sometimes write /(/, a) for J^f da and /(/, a) for J^f da. 

22.6.20. Proposition. Ifa'l on the interval [a,b], then I_{f,a) < I{f,a). 

22.6.21. Example. In the preceding proposition the quantities I_{f,a) and I(f,a) may differ. 

22.6.22. Definition. A function / is said to satisfy the RiEMANN CONDITION with respect to a 

on [a, b] if for every e > there exists Pq £ TZ{P)[a, b] such that 

0<Uf,aiP)-Lf,aiP)<e 

whenever Pq ^ P. 

22.6.23. Proposition, //at on [a,b], then the following are equivalent: 

(a) / G n{R){a) on [a,b]; 

(b) / satisfies the Riemann condition with respect to a on [a, 6]; and 

(c) Iif,a) = I{f,a). 



/ 

J a 
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22.6.24. Proposition (First Mean Value Theorem). If a'f and f € TZ{a) on J = [a, b], then there 
exists a number k in [inf /(J), sup /( J)] such that 



If, in addition, f is continuous on J, there exists c E J such that k = f{c). 

22.6.25. Proposition (Second Mean Value Theorem). If a is continuous and f on [a,b], then 
there exists c G [a, b] such that 



22.6.26. Proposition. Suppose a < c<b. Then a function f belongs to TZ{a) on [a,b] if and only 
if it belongs to Tl{a) on both [a,c] and [c,b], and that in this case 



22.6.27. Proposition. If f is a continuous function on the interval [a,b] and a ^ on [a,b], then 
f G TZ{a) on [a, b] . 

22.6.28. Proposition. If f is a continuous function on the interval [a, b] and a is of bounded 
variation on [a, b], then f G TZ{a) on [a, b]. The same conclusion holds if f is of bounded variation 
and a is continuous. 





fda = f{a)[a{c) - a{a)] + f{b)[a{b) - a{c)] . 




f{x) da{x) = / f{x) da{x) + / f{x) da{x) . 





CHAPTER 23 



PROBABILITY 



Probability theory is a branch of measure theory. As a body mathematics it is straightforward, 
entirely clear, and enjoyable. Its applications to real world events, to the world of statistics are open 
to all manner of controversy. The fundamental principles governing the applications of probability 
theory are subject to endless controversy and are, in my opinion, poorly understood. This short 
chapter is nothing more than a brief introduction to the language of probability theory together 
with a few words of caution concerning some of the more thoughtless aspects of its applications. 

23.1. The Language of Probability 

Although, mathematically, probability theory is a branch of measure theory, some aspects of 
it may seem unfamiliar at first because the language it uses is quite different from the language of 

measure theory, and so is its notation. 

23.1.1. Definition. A PROBABILITY SPACE is a positive measure space (p., 21, P) such that P(f^) = 
1. The set Q is called a sample space, its elements are called outcomes, and its measurable 
subsets, that is, members of 2t are called EVENTS. The measure P is a PROBABILITY MEASURE. 
Events A and B are MUTUALLY EXCLUSIVE if they are disjoint. More generally, the events in a 
family *B C 21 are PAIRWISE mutually exclusive if every pair of distinct members of B are 
mutually exclusive. The family 03 of events is EXHAUSTIVE if their union is all of Q. Events A and 
B are INDEPENDENT if P{A nB) = P{A)P{B). 

23.1.2. Example. A discrete probability space is a probability space ($7,21, P) where the 
sample space Q = {coi: i € /} is countable and 21 = ^($7). The probabilities pi = P{{uJi}) of the 
individual outcomes determine the probabilities of all events in the obvious way 

23.1.3. Example. A special case of the preceding example is the subject of much classical prob- 
ability theory. The subject of study is a finite probability space in which each outcome is equally 
likely; that is, Q = {ui, . . . ,0Jn} and P{{ui}) = ^ for each i G N„. Thus for each event A 

. card A 
= caxdJ7 • 

Clearly problems in this classical probability setting make no use of measure theory. Everything is 
reduced to counting elements of finite sets. 

23.1.4. Example. One frequently encounters phrases such as. 

The probability of rolling any given number from 1 to 6 with a fair die is g. (*) 
Here, of course, we have left the world of mathematics for the world of applications and its attendant 
ambiguities. Although almost anyone would tend to agree with this assertion, it is not easy to 
explain exactly what has been said. What, we might ask, is a fair die? Sometimes it is defined to be 
a perfect cube made of perfectly homogeneous material with the numbers 1 through 6 painted on its 
sides. Since perfection is somewhat beyond the capacities of human engineering, we might enquire 
under what conditions a slightly imperfect die would be regarded for experimental (or gambling) 
purposes as fair. One fairly common response is to require the frequencies of the occurrence of 
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each integer 1 through 6 on a "large" number of rolls to be "roughly" equal. This clearly changes 
the definition of "fair" and makes (*) into a tautology. It also leaves open the questions of how, 
in practice, to interpret the words "large" and "roughly". This approach to probability is quite 
common and is called the relative-frequency interpretation of probability. It certainly cannot be the 
whole story. It is, for example, surely not a sensible way to interpret the assurance from a utility 
company that the risk of a meltdown of a nuclear power station they are building in your town is 
less than one in one hundred million. 

23.1.5. Exercise. Explain how to set up a sample space to model a single throw of a pair of fair 
dice. What is the probability of throwing a pair of 6's? What is the probability of throwing a 7? 
What is the probability of throwing at least 7? 

23.1.6. Exercise. If a word is drawn at random from the statement of this exercise, what is the 
probability it has at least three letters? 

In case you are inclined to ask, while reading the preceding exercise, what the expression "at 
random" means, I must confess that I don't know. In the context of this particular exercise I 
would take it as an instruction to assign probability ^ to the outcome of choosing any particular 
occurrence of any word in the sentence. Thus the probability of choosing the word "drawn" is ^ 
while the probability of choosing the word "at" is Jj. A great deal has been written, much of it quite 
technical, by mathematicians, statisticians, philosophers, computer scientists, and others trying to 
tie down the meaning of the word "random". For an entertaining, informative, but nontechnical 
introduction to the subject I recommend the small book Randomness [ ] by Deborah J. Bennett. 

23.2. Conditional Probability 

23.2.1. Definition. Let A and B be events in a probability space {i^,P) with P{A) > 0. The 

CONDITIONAL PROBABILITY P{B\A) OF THE EVENT B GIVEN A is defined by 

^ ' ' P{A) 

What could be simpler? The (mathematical) definition of conditional probability is entirely 
straightforward. Applying the definition to concrete cases is another matter. Probability is often 
regarded as a measure of conviction. The degree of one's initial conviction concerning the occurrence 
of an event B is the probability P{B) in the absence of information. (This is usually called the prior 
probability of the event.) On the basis of additional information (say the occurrence of event A) it 
is natural for one to alter one's degree of conviction to match the conditional probability P{B\A). 
(This is the posterior probability of B given A.) There is however a serious problem here, one that 
is seldom addressed adequately in statistics texts. What exactly does the "given A" really mean? 
That A has occurred? That you know it has occurred? Or that you know in some special way that 
it has occurred? Pay special attention to exercises 23.2.9 and 23.2.10 to see how treacherous failure 
to be precise about this matter can be. It is difficult to find serious discussions of this matter. 
The most clear-headed treatment I know is Keith Hutchinson's article [25] in The British Journal 
for the Philosophy of Science. Many special cases have received a great deal of attention. (For an 
entertaining example search for "Monty Hall problem" in the Wikipedia [ ].) 

23.2.2. Proposition. // A and B are independent events in a probability space with P{A) > 0, 
then 

P{B\A) = P{B) . 

23.2.3. Proposition. If {Bj} is a countable family of mutually exclusive and exhaustive events in 
a probability space (fi, P) and A is an event in Q, then 



P{A) = Y,P{B,)P{A\B, 



3 



where the sum is taken over all j such that P{Bj) > 0. 
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23.2.4. Proposition. If A, B, and C are events of strictly positive probability (in some probability 
space), then 

P{A U B\C) = P{A\C) + P{B\C) - P{A n B\C) . 

23.2.5. Proposition. If A, B, andC are events of strictly positive probability (in some probability 
space), then 

P{A n B\C) = P{A\C) P{B\A n C) . 

23.2.6. Exercise (Bayes Box Paradox). A chest has three drawers. The first drawer contains 
2 gold coins; the second contains 1 gold and 1 silver coin, and the third contains 2 silver coins. 
Randomly you choose a drawer and a coin from that drawer. It is gold. What is the probability 
that the remaining coin in the drawer you have chosen is gold? 

23.2.7. Proposition (Bayes' Law). Let A and B be events in a probability space (i^,-P) with 
P(^) > 0. Then 

^^^'^^ P{A) ■ 

23.2.8. Exercise. Big Brother Corporation decides to do its part in making the community in 
which it is located a safer and more law-abiding place. It is going to accomplish this by testing all 
its employees (except of course for top management) for methamphetamine (meth) use. It makes 
use of an excellent new test which produces a positive result 99% of the time for meth users and a 
negative result 99% of the time for those who do not use meth. According to recent Government 
statistics approximately 731,000 Americans are current meth users, which is roughly 0.3% of the 
population over the age of 12. Assume that this is the percentage of meth users working for Big 
Brother. 

(a) A Big Brother employee tests positive for meth use. Given the remarkable accuracy of the 
test, what would you guess is the probability that this employee is a meth user? 

(b) Do the math. Use Bayes' law to calculate the probability that the employee is a meth 
user. 

23.2.9. Exercise. (a) What is the probability that when two (fair) dice are rolled that the 

sum of the faces showing is 7? 

(b) At a gambling table a croupier, whom you know to be honest, shakes two dice in a cup, 
puts the cup down on the table, and peeks under it. He says, "One of the dice is a two" . 
What is the conditional probability that the sum of the two dice is 7 given the information 
you have received that one of them is a two? 

(c) A few moments later the croupier again rolls the dice, looks under the cup and says, "one 
of the dice is a . . . " . Someone coughs, causing you to miss the last word he said. You 
conclude that he must have specified some number between 1 and 6. In each of the six 
cases what would you conclude about the conditional probability of getting a 7? Does this 
result seem strange? 

23.2.10. Exercise. Try to find a correct approach (and answer) to each of the following questions. 

Alternatively, give an argument that the question is ill-defined. {Assume for this exercise that the 
probabilities of a child's being a boy and being a girl are equal.) 

(a) You know that Anastasia is the mother of two children of different ages. One day you 
observe Anastasia walking with a young boy. Someone tells you (correctly) that this is the 
older of her children. What is the probability that both of Anastasia's children are boys? 

(b) You know that Anastasia is the mother of two children of different ages. One day you 
observe Anastasia walking with a young boy. Someone tells you (correctly) that this is 
one of her children. What is the probability that both of Anastasia's children are boys? 

(c) You know that Anastasia is the mother of two children of different ages. Someone tells 
you (correctly) that the older of her children is a boy. What is the probability that both 
of Anastasia's children are boys? 
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(d) You know that Anastasia is the mother of two children of different ages. Someone tells 
you (correctly) that at least one of her children is a boy. What is the probability that 
both of Anastasia's children are boys? 

(e) You know that Anastasia is the mother of two children of different ages. Someone tells you 
(correctly) that at least one of her children is a boy named Fred. What is the probability 
that both of Anastasia's children are boys? 

Several versions of the preceding exercise are discussed (rather lightly) in the last chapter of [5] . 
In the same chapter the author discusses a number of other fascinating "paradoxes" of probability. 
For a deeper discussion see [25]. 

23.2.11. Exercise. In a particular region of the United States a study is conducted by a large 
tobacco company in which they find, to their delight, that the lung cancer mortality rate among 
cigarette smokers is actually lower than among nonsmokers. A public interest group does the same 
study among the urban population of the region and finds that the smokers there have a higher 
lung cancer mortality rate than the nonsmokers. A second public interest group does the study for 
the rural (that is, non-urban) population of the region and also finds the smokers to be at a higher 
risk of dying of lung cancer than the nonsmokers. Can you conclude from this that someone here 
is mistaken — or lying? Explain. 



In probability and statistics one encounters random variables. The terminology is odd since 
they are neither random nor variables. 

23.3.1. Definition. A real valued function on a probability space {^l,P) is called a random 
VARIABLE if it is Borel measurable. 

23.3.2. Notation. If X is a random variable on a probability space (il, P) and B is a Borel subset 
of M, then it is usual to write the event {a; G il: X(uj) £ B} as X G B. Similarly, if a € M, then 
X < a denotes the event {u £ Q: X{uj) < a}. 

23.3.3. Definition. Let X be a random variable on a probability space (17, P). The probability 

DISTRIBUTION OF X (or the PROBABILITY MEASURE INDUCED BY X) is the function 

Px : *Bor(M) ^ [0,1]: B ^ P{X € B). 
And the distribution function of the random variable X is the function 



23.3.4. Proposition. In the preceding definition the function Px is indeed a probability measure 
on the measurable space (M, *Bor(M)). 

23.3.5. Proposition. // two random variables on a probability space have the same distribution 
functions, then they induce the same probability measure on M. 

23.3.6. Definition. Let X be a random variable on a probability space {fl, P). The expectation 
(or expected value) of X is defined by 



Jn 

23.3.7. Proposition. Let X be a random variable on a probability space {^,P) and 51: M — )• M be 

a Borel measurable function. Then 
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Fx : M ^ [0, 1] : X P{X < x). 
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23.3.8. Definition. Let X be a random variable on a probability space (il,P). The VARIANCE of 
X is defined by 

Var(X) := E{{X - E{X) f) . 

If Var(X) < oo, then X is of FINITE VARIANCE and cr{X) := i/Var(X) is called the STANDARD 
DEVIATION of X. 

23.3.9. Proposition. If X is a random variable on a probability space, then 

\ar{X) = E{X^) - {E{X)Y . 
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PRODUCT MEASURES AND ITERATED INTEGRALS 

24.1. Product Measures 

In this chapter we discuss the product and the coproduct of a pair of measurable spaces. On 
the product of two u-finite positive measure spaces we introduce a "product measure". Then 
we examine the (somewhat fussy) relationship between integration with respect to this product 
measure and the usual "iterated integration" encountered in beginning calculus courses. 

24.1.1. Definition. Let (5, 21) and (T, 53) be measurable spaces. If ^ G 2t and 5 G 53, then AxB 
is a MEASURABLE RECTANGLE in S* X T. Let 21 (g) ?B be the cj-algebra generated by the family of all 
measurable rectangles in S xT. A common notation for 21 (8> 53 is (the very misleading) 21 x 53. The 
measurable space (5 x T, 21(g) 53) is the product space of the measurable spaces (5, 21) and (T, 53). 
When it appears convenient to avoid explicit mention of the a-algebra 2t (g 53 one may choose to 
write C 5 X T for e 21 g) 53. 

24.1.2. Example. 53or(M) (g 53or(]R) = 53or(M2). 

24.1.3. Example. The product of measurable spaces as defined in24.1.1 (together with the two ob- 
vious projection functions) is in fact a product in the category of measurable spaces and measurable 
maps (see 8.3.1). 

Hint for proof . In 24.1.1 (S x T, 2l(g 53) is a measurable space by definition. It is easy to verify 
that the usual projection functions vri and 7r2 from S x T to S and T, respectively, are morphisms 
(that is, are measurable functions). Let (U, €) be an arbitrary measurable space. It should be clear 
that there is only one function h mapping U to S x T for which / = vri o /i and g = 7r2 o h. The 
only part of the proof that requires some thought is showing that /i is a morphism. It is a routine 
calculation to prove that h*^{R) belongs to €. whenever ii is a measurable rectangle in S* x T. Use 
proposition 15.1.9 to prove that this is enough to establish the measurability of h. 

24.1.4. Exercise. Identify the coproduct of two measurable spaces (if it exists). Hint. Recall 
example 8.4.4. 

24.1.5. Proposition. If {S,%) and (T, *B) are measurable spaces, then the family of finite (pair- 
wise) disjoint unions of measurable rectangles is an algebra of subsets of S xT. 

Hint for proof . Instead of using the definition given in 5.6.11 for an algebra of sets show that 
the family in question is closed under (finite) intersections and taking of complements. 

m 

24.1.6. Notation. Let S and T be measurable spaces and E C S x T. For each x £ S we define 
:= {y £ T: {x,y) £ E}; this is a VERTICAL SECTION of E. Similarly, For each y £ T we define 

Ey := {x £ S : (x, y) £ E}; this is a HORIZONTAL SECTION of E. 

If/: SxT— T-Risa measurable function, then for each x £ S we define 

-f:T^R:y^ f{x,y) 

and for each y £ T we define 

fy-. S ^R: x^ f{x,y). 
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24.1.7. Proposition. Let S and T be measurable spaces. 

(a) If E (1 S X T , then for every x € S the section is a measurable subset of T and for 
every y & T the section E^ is a measurable subset of S. 

(b) // / : S" X T — 7- M is measurable, then so are the functions ^f and f^ for x £ S and y G T. 

Hint for proof . For the first claim (tiiat eacli ^E is measurable) let 

e = {E C S xT-.^E for every x e S} 
and use the good sets principle. 

24.1.8. Proposition. Let (S", 21, ^) and (T, !B,z^) be a-finite positive measure spaces. For each set 
£; G 21 (g) 53 define 

: 5 ^ [0, oo] : X ^ iy{^E) 

and 

gj^:T^[0,^]:y^fi{Ey). 
Then f^ is ^-measurable, is ^-measurable, and 

j^f^dfi = j^g^du. (24.1) 

Hint for proof . Notice first that for each x £ S we can write v{^E) as J^, u) dviu)- 

Let ^ be the family of all sets E' in 21 (8) 5S for which (24.1) holds. Show that a subset Eq of 
S X T belongs to 2; if it satisfies any of the following four conditions: 

(i) it is a measurable rectangle; 

(ii) it is the limit of an increasing sequence of sets belonging to (B; 

(iii) it is the union of a countable disjoint family of sets belonging to (B; or 

(iv) it is the limit of a decreasing sequence of sets belonging to G; the first of which is contained 
in a measurable rectangle whose sides have finite measure. 

By hypothesis S = \ST=i '^'c ^^"^ ^ ~ Wfc^i where each 5^ and has finite measure. (To 
accommodate the finite case we allow some of these sets to be empty.) For each j, k G N and 
E £ £ let Ejk = En{Sj xTk). Then let Tl be the family of ah sets e 21 O 55 such that Ejk 
belongs to (S for every j and k. Deduce from (ii) and (iv) that 9Jt is a monotone class. From (i) 
and (iii) conclude that the family $H of finite disjoint unions of measurable rectangles is contained 
in Use propositions 24.1.5 and 15.2.9 to show that 21 (8" 55 = 9Jt. Then a simple application of 
(iii) shows that 21 O 55 = (£. 

24.1.9. Proposition. Let {S, 21, ^) and (T, 55, u) be a-finite positive measure spaces. On the family 
of finite disjoint unions of measurable rectangles in S xT, define a function p by 

n 

p{R) = Y,i^{M)y{Bk) 

k=l 

where R = ^ is a finite disjoint union of measurable rectangles Aj. x B^. Then p 

extends to a positive measure on cj(9^). 

Hint for proof . For E G 2l(8)!B let p{E) = vi^^E) dp{x) or, equivalently (by proposition 24.1.8), 
p{E) = Jrp p{Ey) dv{y). Use proposition 19.2.8 to establish countable additivity. 

24.1.10. Definition. The measure p produced in the preceding proposition is the product mea- 
sure on the measurable space SxT. It is frequently denoted hy pxv. The space (SxT, 21(8)55, /ixz^) 

is the PRODUCT MEASURER SPACE. 

24.1.11. Example. If p is counting measure on N, then p x p is counting measure on N x N. 
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24.2. Complete Measure Spaces 

It is natural to find comfort in a result such as proposition 15.3.15, which tells us that the 
pointwise limit of a sequence of measurable functions is measurable. Our sense of well-being may, 
however, be somewhat diminished when it occurs to us that in practice pointwise convergence 
of sequences of functions in measure spaces is not all that common. Much more frequently we 
encounter convergence almost everywhere. Well then, does the convergence almost everywhere of a 
sequence of measurable functions to a function g imply that g is measurable? Unfortunately not. 
We give a simple example of how this failure may occur in 24.2.3. In preparation for the example 
we need to know that not every subset of the Cantor set is a Borel set. 

24.2.1. Example. Let C be the Cantor set and / be the Lebesgue singular function (see exam- 
ple 22.5.1). The measure space we are considering is ([0, 1], *Bor([0, 1]), A), where A is Lebesgue 
measure restricted to the Borel subsets of [0, 1]. Since we are restricting our attention to the unit 
interval, we denote by the complement of the Cantor set in [0, 1]. On the function / is locally 
constant. (A function is locally constant on a set if every point of the set has a neighborhood 
on which the function is constant.) It follows from this that A(/~^(C)) = 1. 

Let g{x) = x + f{x) for all x G [0, 1]. Then g is a homeomorphism between [0, 1] and [0, 2]. (To 
prove that g~^ is continuous, notice that g takes open intervals to open intervals.) From this we 
deduce that g~^{C) has Lebesgue measure one. Thus by example 18.3.10 g~^{C) has a subset, call 
it F, which is not Lebesgue measurable. Let E = g'^(F). Then E is a subset of the Cantor set. If 
it were a Borel set, then F too would be a Borel set and would therefore be Lebesgue measurable. 
So is a subset of the Cantor set which is not a Borel set. 

24.2.2. Example. The set E in the preceding example shows that a subset of a set of measure 
zero need not be measurable. 

24.2.3. Example. As in example 24.2.1 we consider the positive measure space ([0, 1], 55or([0, 1]), A), 
where A is Lebesgue measure restricted to the Borel subsets of [0, 1]. For each n € N let be the 
function identically zero on [0, 1]. Let S be a subset of the Cantor set which is not a Borel set 
(see 24.2.1) and g = x^- Then each is (Borel) measurable, fn ^ 9 (A-a.e.), but g is not (Borel) 
measurable. 

If we contemplate the source of the unpleasantness in the preceding example we soon see that 
what our intuition tells us is to ignore sets of measure zero. They don't matter. But this is not 
going to work well if sets of measure zero can have nonmeasurable subsets. This, basically, is the 
whole problem. Fortunately its solution is fairly straightforward. 

Among the badly overworked terms in mathematics is "complete". In these notes we have 
already encountered metric completeness, order completeness, Dedekind completeness, a-Dedekind 
completeness, and product completeness (of categories). And shortly we will discussing complete 
orthonormal sets. Well, here is one more. 

24.2.4. Definition. A positive measure space {S, 21, fj,) is complete if every subset of a measurable 
set of measure zero is itself measurable; that is, if G 21 whenever E Q A, A £ and n{A) = 0. 

24.2.5. Example. The positive measure space (M, 93Ta)^) (where A is Lebesgue measure and dJlx 
is the (T-algebra of Lebesgue measurable subsets of M) is complete. (See example 18.3.2.) 

24.2.6. Example. The positive measure space ([0, 1], ^or([0, 1]), A), where A is Lebesgue measure 
restricted to the Borel subsets of [0, 1] is not complete. (See example 24.2.2.) 

24.2.7. Example. The product of two complete positive measure spaces need not be complete. 
Consider the complete positive measure space comprising the real line and Lebesgue measure on 
the Lebesgue measurable subsets of M. The product of this space with itself is not complete. 

Hint for proof . Consider the set A x {0} C M x M, where ^4 is a nonmeasurable subset of M. 
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In these notes we have discussed Lebesgue measure, which generalizes the notion of length, 
on M. Of course, it is important to have corresponding positive measures, also called Lebesgue 
measures, on higher dimensional Euclidean spaces which generalize the notions of area, volume, 
etc. Many texts develop Lebesgue n-dimensional measure in detail — see, for example, [2], section 
15; [ ], Chapter 1, sections 3-4; [37], sections 2.19-2.21; or [44], chanpter V, section 1. 

It turns out that for each n E N Lebesgue n-dimensional measure produces a complete positive 
measure space on the cr-algebra of Lebesgue measurable subsets of M". It is worth noting that (as 
we discovered in example 24.2.7) if A is Lebesgue measure on M, then the product measure space 
with measure A x A on is not complete. So Lebesgue measure on is certainly not A x A. 

The good news here is that every positive measure space can be embedded in a slightly larger 
such space (called its completion), and, further, that Lebesgue measure on is just the completion 
of the product space whose measure is A x A (see proposition 24.2.11 below). The fundamental idea 
of the completion of a measure space is to throw in the sets that are missing — all the subsets of 
measurable sets of measure zero. Of course, we must do that in such a way that we end up with a 
(7-algebra of measurable sets. Here is how it is done. 

24.2.8. Notation. Let {S, 21, /x) be a positive measure space. Define a new family 21 of sets as 
follows: a subset A of 5 belongs to 21 if and only if there exist sets E, F £ ^ such that 

(i) E CACF and 

(ii) ti{F\E) = 0. 

Also define /I: 21 — >• [0, oo] by 

JliA) = f,iE) = fiiF). 

24.2.9. Proposition. With notation as above {S, 21, /x) is a complete positive measure space. 

Hint for proof . Don't forget to show that Jl is well-defined. In particular, it must be shown that 
the value J1{A) does not depend on the choice of the particular sets E and F satisfying conditions 
(i) and (ii). (If you get stuck, you can find a detailed and readable proof in ['. ' ], proposition 1.5.1.) 

24.2.10. Definition. With notation as above the complete positive measure space (S, 21, /I) is the 

COMPLETION of the space (S*, 21, //). 

24.2.11. Proposition. For each p £ N let Ap be Lebesgue measure on and dJlp be the family 
of Lebesgue measurable subsets ofW. Then (M'^"*"'^, QJlp+g, Ap+g) is the completion of the space 
(MP+«,9Jtp(»9JTg,Ap X Ag). 

Proof. See [37], theorem 8.11. 

24.3. The Theorems of Fubini and Tonelli 

When a mathematician claims in a proof to be making use of Fubini's theorem (s)he may 
be referring to any one of a dozen or so closely related theorems about changing the order of 
integration. Below are two typical versions of these theorems. Try using proposition 24.1.8 to 
prove them. Almost any good text on Measure Theory or Real Variables will have proofs of these 
(or very similar) results. 

24.3.1. Theorem (Tonelli). Let {S,fj,) and {T,v) be a-finite positive measure spaces. If f is a 
positive fj, X v-measurable function, then the functions 

g: S ^ [Q,oo\: X ^ / ^f dv and /i : T [0, oo] : y i-^ / d/i 
Jt Js 



are measurable and 



fd{^i X v) 
SxT 



T 



f{x,y) du{y) 



dfi{x) 



s 



f{x,y) dfi{x) 



du{y) 
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24.3.2. Theorem (Fubini). Let {S,fi) and {T,u) be a-finite positive measure spaces. If f £ 
Li(/i X v), then ^f £ Li(i^) for almost all x £ S, f^ € Li(/u) for almost all y € T, and the functions 

g: S ^ [0,oo]: X i-^ / ^f dv and h: T ^ [0,oo]: y i-^ / f^ d/j. 

Jt Js 

(defined almost everywhere) belong, respectively, to Li(/x) and Li^v). Furthermore, 



/ fd{iixv)= I I f{x,y)dv{y) dix{x) = / / f{x,y) 
JsxT Js Ut J Jt Us 



diJ,{x) 



dv{y) 



2 2 

24.3.3. Exercise. Let fix, y) = , ^„ ~ for (x, y) / (0, 0) and /(0, 0) = 0. Compute the 

(x^ + y'^'Y 

integrals [Jq /(x, y) dx] dy and [jj^ /(x, y) dy] dx. How is this related to the theorems of Fubini 
and Tonelli? Try to explain clearly what is going on here by converting the integral to polar 
coordinates. 

24.3.4. Exercise. What does Fubini's theorem say about the measure spaces (N,^(N),/i) and 
(N,*P(N),i^) where /x and v are both counting measure? What does Tonelli's theorem say in this 
case? 

24.3.5. Proposition. Let (S, jj) and (T, v) be a-finite measure spaces and the functions f-.S^M. 
and g: T —^M. be fi-integrable and u-integrable, respectively. Then the function 

f^g-.SxT^R: {s,t) ^ f{s)g{t) 

is n X u-integrable and 

J f®gd{iixu)=(^jfdiJ^-(^jgdv 

SxT S T 

X — v 

24.3.6. Example. Define / on (0,1) x (0,1) by f{x,y) = r^. Then / does not belong to 

the space >Ci((0, 1) X (0,1)). \.x + y) 

24.3.7. Exercise. Let pL be counting measure on [0, 1] and A be Lebesgue measure on [0, 1]. Define 

/: [0,1] X [0,1] — 5- R by letting /(x,y) = 1 on the line y = x (0 < x < 1) and elsewhere. 
Calculate /J /(x, y) dfi{x) dX{y) and Jq Jq f{x, y) dX{y) dp{x). Explain why your result does not 
contradict the Fubini and Tonelli theorems. 

24.3.8. Proposition. For a function f defined on M let F be defined on by F{x,y) = f{x — y). 
If f is measurable, then so is F. 

24.3.9. Proposition. For f, ge Ci{M.) define 

/oo 
f{x-y)g{y)dy (1) 
-oo 

whenever the function y i— )■ /(x — y)g{y) belongs to £i(M). Then h(x) is defined and finite for 
almost all x. Set h{x) = whenever (1) is undefined. Then h G i2i(M) and \\h\\\ < \\f\\i \\g\\i- The 
function h is usually denoted by f * g; this is the CONVOLUTION of f and g. 

24.3.10. Example. The Banach space Li(M) becomes a commutative Banach algebra when con- 
volution is used as multiplication. (Why is it alright to let [/] * [g] be [/ * g"]?) 

24.3.11. Exercise. Let t > 0. Use the Fubini- Tonelli theorems to show that 

ft „• ™ rco / ft 



/ dx = ( e sin X dx] dy . 

Jo X Jq \Jo J 



Take the limit as i ^ oo to derive the formula 



/ 

Jo 



°° sinx , TT 



dx 



X 2 
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24.3.12. Exercise. Let ^ be a measurable subset of [a, b] and h> 0. Show that 

rb 



1 f 

— I \{A r\{x-h,x + h)) dx < X{A) . 
s an integral. 

24.3.13. Exercise. Evaluate e^^ dx dy. (Justify each step carefully.) 



2h 

Hint. Express the integrand as an integral. 

-1 rl 



CHAPTER 25 



SOME REPRESENTATION THEOREMS 



25.1. The Radon-Nikodym Theorem 



25.1.1. Theorem (Radon-Nikodym). Let {S,'Qi,fj,) be a a-finite positive measure space and v be a 
complex (or real, or a-finite positive) measure on 21 such that v <^ fi. Then there exists a unique 
f E Li{S,iJ,) such that 



for every yl G 21. 

Proof. See [2], theorem 27.6; [11], theorems 4.2.2 and 4.2.3; [23], theorems (19.23), (19.24), 
and (19.36); [ V..-], theorem 6.10; and [ ], chapter 11, proposition 23. 

The preceding theorem is often referred to as the Lebesgue-Radon-Nikodym theorem. It is not 
difficult to extend it to other cases. Here for example is the theorem when both /i and are real 
measures. 

25.1.2. Theorem (Radon-Nikodym). Let (5, 21, /i) be a real measure space, let v be another real 
measure on 21, and suppose that z/ <C /x. Then there exists a unique f G Li(5, /i) such that 



for every yl G 21. 

25.1.3. Definition. The function / in the preceding two theorems is called the Radon-Nikodym 

DERIVATIVE of 1^ with respect to and is often denoted by — . One may also write dv = f d^. 

dji 

25.1.4. Theorem (Lebesgue Decomposition). Let (X, 21, /i) be a a-finite positive measure space 
and V be a complex (or real, or a-finite positive) measure on 21. Then there exist unique measures 
Va o.nd on 2t such that 

(1) Vs -L ^, 

(2) Va ^ and 

(3) V = Va + Vs- 

Proof. See [2], theorem 27.7; [11], theorem 4.7.1; theorem (19.42); [ ], theorem 6.13; 
and [ ], chapter 11, proposition 24. 

John von Neumann used Hilbert space techniques to give a lovely proof of the Radon-Nikodym 
theorem which at the same time proves the Lebesgue decomposition theorem. A proof along these 
lines can be found in [37], theorem 6.10. 

25.1.5. Proposition. Let /i, v, and p be real measures on a measurable space S. If p <^ fi and 

v <^ p, then p + v <^ p and 





d{p + u) dp du 



(p-a.e.) 
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25.1.6. Proposition (Chain rule). Let ^, v, and p he real measures on a measurable space S. If 
p <^ u and u <^ fi, then p <^ fj. and 

dp dp dv 
dp du dp 

25.1.7. Proposition. Let p and v he real measures on a measurable space S. If p and u p, 

then 

du dp 

TpT. = ^ ^^"-"-'-^ 

25.2. Regular Borel Measures 

25.2.1. Definition. Let X be a locally compact Hausdorff space and (X, 21, p) be a measure space. 
We say that /u is a Borel measure if 21 contains the family !Bor(X) of all Borel subsets of X. 

25.2.2. Definition. Let be a positive real Borel measure on a locally compact Hausdorff 
space X and E (1 X. The measure p is inner regular on E if p{E) = sup{/i(K): K C 
E and K is compact } . We say that p is INNER regular if it is inner regular on every Borel 

subset of X. The measure p is outer regular on E if p{E) = mi{p(U): E U <Z X]. We 
say that p is outer regular if it is outer regular on every Borel subset of X. A positive real 
measure is regular if it is both inner and outer regular. A real Borel measure is regular if both 
/i"*" and ^~are and a complex Borel measure is regular if both its real and imaginary parts are. 
A positive Borel measure is regular if it is inner and outer regular and satisfies the additional 
condition that p{K) < oo for every compact set K X. 

25.2.3. Proposition. A real or complex Borel measure on a compact Hausdorff space is inner 
regular if and only if it is outer regular. 

25.2.4. Proposition. A real or complex measure on a locally compact Hausdorff space X is regular 
if and only if for every Borel set B in X and for every e > there exist an open subset U and a 
compact subset K of X such that K C B C U and \p\{U \ K) < e. 

25.2.5. Notation. If X is a compact Hausdorff space we denote by M(X) or by rca(X) the family 
of all regular real measures defined on the Borel subsets of X. 

25.2.6. Proposition. If X is a compact Hausdorff space, then M{X) is a Riesz space. 
Hint for proof . Show that M(X) is an order ideal in ca(X). 

25.2.7. Proposition. If X is a compact Hausdorff space, then M{X) is a normed linear space. 
(See example 16.5.19.) 

25.2.8. Proposition. Let (S, 21, p) be a measure space, (T, *B) be a measurable space, andcp: S — )• T 
be a measurable function. Then the function M(j){p) := po (p*^ is a measure on T. 

25.2.9. Proposition. Let tp: X be a continuous function between compact Hausdorff spaces. 
Define 

M(j): M{X) M{Y) : p ^ p o c/)^ . 

Then p o (p^ £ M(Y) and M is a covariant functor from the category CpH of compact Hausdorff 
spaces and continuous maps into the category NLSi of normed linear spaces and contractive linear 
maps. 

25.2.10. Proposition. Let (f): X be a continuous function between compact Hausdorff spaces. 
As above define 

M(P: M{X) M{Y) : p ^ p o cj)^ . 

Then M is a covariant functor from the category CpH of compact Hausdorff spaces and continuous 
maps into the category RSP of Riesz spaces and positive maps. 
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25.2.11. Theorem (Change of variables). Let (5", /x) be a real (or complex) measure space, (T, 53) 
be a measurable space, and (p: S ^ T be a measurable function. If u = M(/){fj,) is defined as in 
proposition 25.2.8), then 



f o(/)dfi= / fdu 
IS JT 

for every measurable function f : T ^ M for which either of the integrals exists. 

This is by no means the whole story concerning changing variables. For a deep and meticulous 
account see Fremlin [ ], volume II, sections 235 and 263. 

25.3. Natural Transformations 

25.3.1. Definition. Let A and B be categories and F, G: A — )• B be covariant functors. A 
NATURAL TRANSFORMATION from F to G is a map r which assigns to each object ^ in A a 
morphism ta- F{A) — t- G{A) in B in such a way that for every morphism a: A ^ A' in A the 
following diagram commutes. 

F{A) F{A') 
G{A) > G{A') 

G(a) 

We denote such a transformation by r : F ^ G. (The definition of a natural transformation between 
two contravariant functors should be obvious: just reverse the horizontal arrows in the preceding 
diagram. 

A natural transformation r : F — t- G is a natural equivalence if each morphism ta is an 
isomorphism in B. 

25.3.2. Example. Let V he & finite dimensional vector space and x € V. Define 

x: ^R: f ^ f{x) . 

Recall that V'^ is the algebraic dual of V , that is, the vector space of all linear functionals on V . 
The second dual (yt)t ig denoted by ytt. liT:V is a linear map between finite dimensional 

vector spaces, then its algebraic dual is defined as in 15.4.6: if € W^^, then T\g) is the composite 
linear map gT. And of course denotes (T"!")^ The second dual functor (•)''"''" is the pair 
of maps V I— >■ V'^\ T i— t- T'^'^ from VEC into itself. Because the dual functor is contravariant, the 
second dual functor is covariant. 

On the category VEC of vector spaces and linear maps let / be the identity functor and ( • )^^ 
be the second dual functor. The natural map t, where (for every vector space V) 

Ty-.V^ y"'"'' : x^x 

is a natural transformation from I to ( • )^^. It is in fact a natural equivalence. 

25.3.3. Example. Recall that if i? is a Banach space its dual space B* is the set of all continuous 
linear functionals on B. On B* addition and scalar multiplication are defined pointwise. The norm 
is the usual operator norm: ii f £ B* , then 

11/11 := inf{Af > 0: \ f{x)\ < M\\x\\ for ah x £ B} = sup{|/(x)| : < 1}. 

Under the metric induced by this norm the dual space B* is complete and thus is itself a Banach 
space. If T: B — )• C is a continuous linear map between Banach spaces, T* : C* — )• B* is defined 
as for vector spaces: T*{g) = g oT for every g € C* . The map T* , called the adjoint of T, is a 
continuous linear map. The dual B** of the dual B* of a Banach space B is the second dual of B 
and r** := (T*)* maps B** into C**. If S is a Banach space, then the map t^: B ^ B** : x ^ x 
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where x**{f) = f{x) for all f G B* is called the natural injection of B into B**. (It is not 
obvious that this mapping is injective. We will show that it is in proposition 29.1.12.) 

On the category BANqo of Banach spaces and bounded linear transformations let / be the 
identity functor and ( • )** be the second dual functor. Then the natural injection r is a natural 
transformation from I to (•)**. 

25.3.4. Example. Explain exactly what is meant when people say that the isomorphism 

C{X^Y)^C{X) xC{Y) 

is a natural equivalence. The categories involved are CpH (compact Hausdorff spaces and contin- 
uous maps) and BANqo (Banach spaces and bounded linear transformations). 

25.3.5. Example. On the category CpH of compact Hausdorff spaces and continuous maps let 
/ be the identity functor and let C* be the functor which takes a compact Hausdorff space X to 
the Banach space {C{X))* and takes a continuous function cj): X ^ Y between compact Hausdorff 
spaces to the contractive linear transformation C*</> = (C(j))*. Define the evaluation map E by 

Ex: X ^C*{X): Ex{x) 

where {Ex{x)){f) = f{x) for all / G C{X). 

(a) Notice that this definition does make sense. If x £ X, then Ex{x) really does belong 
to C*{X). 

(b) Show that the map E makes the following diagram commute 



X 



Y 



Ex 



Ey 



Y 1 

C*{X) ^C*{Y) 

C*{<P) 

The preceding example may seem a bit disappointing. It seems very much as if we are at- 
tempting to establish a natural transformation between the identity functor (on CpH) and the C* 
functor. The problem, of course, is that the two functors have different target categories. This is 
something we will fix in chapter 30. The evaluation map Ex : X — )• C* (X) is certainly not sur- 
jective. Notice, for example, that every functional of the form Ex{x) preserves multiplication (as 
well as addition and scalar multiplication). A consequence of this is that such functionals all live 
on the unit sphere of C*(X) (see proposition 30.2.7). With a new topology (the so-called weak-star 
topology — see 29.2.5) on C*{X) the range of Ex turns out to be a compact Hausdorff space which 
is naturally equivalent to X itself. Even more remarkably, we show that the range of Ex can be 
regarded as the maximal ideal space of the algebra C{X). See theorem 30.2.23. 

25.3.6. Example. Let M be the functor defined in 25.2.9 (or the one defined in 25.2.10) and C* 

be the functor defined in 25.3.5. Define the Radon map G : M — )• C* as follows: for every compact 
Hausdorff space X let 

Qx : m{X)^C*{X): ^l^(t>^ 
where (t>^{f) = J-y f dfj, for every / G C{X). Then the Radon map O is a natural transformation. 



25.4. The Riesz Representation Theorems 

25.4.1. Theorem (Riesz representation I). Let {S,n) be a positive measure space, 1 < p < oo, 
and q be the conjugate exponent of p. If cf) is a bounded linear functional on Lp(//) then there exists 
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a unique g E Lq(/u) such that 

Js 

for all f G Lp(/i). Additionally, this result holds for p = 1 provided that the positive measure space 
S is a -finite. 

Proof. See [•^], theorems 27.9 and 27.10; [32], theorem 9.12; and [ ], theorem 6.16. 

25.4.2. Corollary. If {S, fi) is a a-finite positive measure space, 1 < p < co, and q is the conjugate 
exponent ofp, then (Lp(//))* is isometrically isomorphic to Lq(/i). 

25.4.3. Theorem (Riesz representation II). Let X be a compact Hausdorff space. If (p is a positive 
linear functional on C{X), then there exists a unique positive regular Borel measure fj, on X such 
that 

Jx 

for all f € C{X). 

Proof. See [4], theorem 4.2.10. 

It would be pleasant if theorem 25.4.3 were true for locally compact Hausdorff spaces as well. 
And it is — almost. You can find an example of a (fairly pathological) locally compact Hausdorff 
space for which it fails in [37], chapter 2, exercise 17. One solution to the problem, the one we 
invoke here, is to require a little less of the representing measure than regularity. If, in the definition 
of "regularity" , we restrict the demand of inner regularity to the open sets in the space (rather than 
requiring it of all Borel sets) we have what we will call a Radon measure. (Compare definitions 
25.2.2 and 25.4.4.) Then 25.4.3 holds for all locally compact Hausdorff spaces if "regular" is replaced 
by "Radon" in the statement of the theorem. 

A second approach is to require a little bit more of our topological space. If we adopt the attitude 
that all reasonably nice locally compact Hausdorff spaces (all Euclidean spaces, for example) are 
cr-compact (that is, a countable union of compact subspaces), and restrict our attention to those 
spaces, the theorem remains true. (Consult [ ], theorem 2.18.) 

A third approach (a disguised version of the first) is to change the definition of "regular" . Hewitt 
and Stromberg [23] (definition 12.39) use the word "regular" (as applied to positive measures) for 
the concept we here call "Radon". The price paid for this strategy is a very different looking 
definition for regularity of complex measures. (Compare [23] definitions 12.39 and 20.40.) For 
complex measure spaces these two definitions do indeed turn out to be equivalent, but it requires 
a fairly substantial theorem to establish this (see [23], theorem 12.40). 

25.4.4. Definition. A positive Borel measure on a locally compact Hausdorff space X is a 
Radon measure if 

(i) /u(-ftr) < oo for every compact subset K of X, 

(ii) n is outer regular, and 

(iii) fi is inner regular on every open subset of X. 

It is worthwhile to keep in mind how seldom the distinction between Radon measures and 
regular Borel measures is required. On spaces which are countable unions of compact sets, the 
concepts are identical. 

25.4.5. Proposition. Let X be a locally compact Hausdorff space which is a countable union of 
compact sets. Then every Radon measure on X is regular. 

Proof. See [18], proposition 7.5 and corollary 7.6. 
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25.4.6. Theorem (Riesz representation III). Let X be a locally compact H aus dor jj space. If (p is 
a positive linear functional on Cc{X), then there exists a unique positive Radon measure ^ on X 
such that 

m= [ fdfi 

Jx 

for allfeCciX). 

Proof. See [2], theorem 28.3; [11], theorem 7.2.8; [18], theorem 7.2; theorems 12.35 and 
12.36; and [37], theorem 2.14. 

25.4.7. Theorem (Riesz representation IV). Let X be a compact H aus dorff space. If (f) is a bounded 
linear functional on C{X), then there exists a unique real regular Borel measure ^ on X such that 

Hf)= [ fdfi 
Jx 

for allfeC{X) and = ||0||. 

Proof. See [ ], theorems 4.3.13 and 4.3.15; [ ], theorems 18.3 and 18.4; and [ ], theorem 
9.15. 

25.4.8. Corollary. If X is a compact Haus dor jJ space, then (C(X))* is isometrically isomorphic 
to M{X). 

25.4.9. Theorem (Riesz representation V). Let X be a locally compact Haus dorff space. If (p is a 
bounded linear functional on Co{X), then there exists a unique real regular Borel measure fi on X 
such that 

Jx 

for all f G Co{X) and \\fj.\\ = 

Proof. See [,..,:], theorem 7.17; [ : :], theorems 20.47 and 20.48; [ : ], theorem 9.16;and [37], 
theorem 6.19. 

25.4.10. Corollary. If X is a locally compact Hausdorff space, then (Co(^))* is isometrically 
isomorphic to M{X). 

25.4.11. Corollary. If X is a locally compact Hausdorff space, then M{X) is a Banach space. 

25.4.12. Proposition. The Radon map Q: M— >• C* defined in example 25.3.6 is a natural equiv- 
alence. 
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CONNECTED SPACES 

26.1. Connectedness and Path Connectedness 

26.1.1. Proposition. If X is a topological space, then the following conditions are equivalent: 

(a) X is the union of two disjoint nonempty open sets; 

(b) X has a nonempty proper subset which is both closed and open; 

(c) there is a subset of X whose characteristic function is continuous but not constant; and 

(d) the algebra C{X) of continuous real valued functions on X has a nontrivial idempotent. 

26.1.2. Definition. A topological space which satisfies any (hence all) of the conditions listed in 
the preceding proposition is disconnected. If U and V are nonempty disjoint open sets whose 
union is the topological space X, we say that U and V disconnect X. A space is connected if 
it is not disconnected. 

26.1.3. Example. Every discrete topological space with more than one point is disconnected. 

26.1.4. Example. Every indiscrete topological space is connected. 

26.1.5. Example. Every infinite set with the cofinite topology is connected. (Otherwise there 
would be a finite set with a finite complement.) 

26.1.6. Example. The space Q of rational numbers (under the topology it inherits from M) is 
disconnected. 

26.1.7. Example. The connected subsets of M are the intervals. 

The next proposition says that the continuous image of a connected space is connected. 

26.1.8. Proposition. If X and Y are topological spaces, X is connected, and f : X ^ Y is a 

continuous surjection, then Y is connected. 

26.1.9. Example. Let X be a connected topological space. Show that a function / : X — t- Q is 
continuous if and only if it is constant. 

26.1.10. Theorem (Intermediate value theorem). ///: M — )■ M is continuous, then the image 
under f of an interval is an interval. 

When is a subspace of a topological space connected? It is tempting — and hopelessly wrong — to 
conjecture that a subspace W of a topological space X is disconnected if and only if there exist 
disjoint open subsets U and V of X both of which intersect W and whose union contains W. 
Certainly the "if" part of this assertion is correct. The next example, however, shows that W may 
well be disconnected even when there do not exist such subsets U and V of X. This is the reason 
for proposition 26.1.12. It gives a criterion for a subspace of X to be disconnected which does not 
rely on the relative topology ofW. The criterion is in terms of the topology on X. 

26.1.11. Definition. Two subsets A and B of a topological space X are mutually separated 
if 

AnB = AnB = (/}. 
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26.1.12. Example. Let X = {a,6,c} and T = {0, {6}, {o, 6}, {6, c}, X}. Then 1 is a topology 
on X. There exists a subset W oi X which is disconnected but which cannot be disconnected by 

o 

two open subsets of X. (That is, there do not exist disjoint sets U ,V X both of which intersect 
W and whose union contains W .) 

26.1.13. Proposition. A subset C of a topological space X is disconnected if and only if it is the 
union of two nonempty mutually separated subsets of X . 

Proof. See [io], theorem 26.5. 

26.1.14. Proposition. If & is a family of connected subsets of a topological space X whose inter- 
section is nonempty, then {J& is connected. 

26.1.15. Corollary. If{Ck) is a sequence of connected subsets of a topological space and Cnf^Cn+i 
is nonempty for every n G N, then IJ^i is connected. 

26.1.16. Proposition. In a topological space the closure of a connected set is connected. In fact, 
ifCQA'^C'^X and C is connected, then so is A. 

26.1.17. Proposition. A nonempty product X = Y\xeA^>^ topological spaces is connected if 
and only if each X\ is connected. 

Proof. See [ LG], theorem V.1.7; [ ], theorem 6.6.3; and [ ], theorem 26.10. 

26.1.18. Definition. Let a be a point in a topological space X. We define Ca, the component 
(or CONNECTED COMPONENT) of X containing a to be the union of all connected subsets of X 
which contain a. Thus Ca is the largest connected subset of X containing a. 

26.1.19. Proposition. Every component of a topological space is a closed set. 

26.1.20. Proposition. Let X be a topological space and x, y £ X. If y £ Cx, then Cy = Cx and 
ifyi Cx, then CynCx = 0. 

26.1.21. Example. No two of the intervals (0, 1), (0, 1], and [0, 1] are homeomorphic. 

26.1.22. Example. A closed bounded interval [a,b] in M, a closed disk in the plane, and the 
boundary of a disk (a circle) are all connected compact Hausdorff spaces, but no two of them are 
homeomorphic . 

26.1.23. Proposition. Let E be a Lebesgue measurable subset o/M with Lebesgue measure one. 
For each t € M let Ef = {x £ E : x < t} . Then for every a £ (0, 1) there exists a number to € M 
such that X[Et^) = a. 

A concept closely related to (but stronger than) connectedness is arcwise connectedness. 

26.1.24. Definition. A topological space X is arcwise connected (or path connected) if 
for every x, y £ X there exists a continuous map /: [0, 1] X such that /(O) = x and /(I) = 
y. Such a function / is an arc (or path , or curve) connecting x to y. It is very easy to 
prove that arcwise connected spaces are connected (proposition 26.1.25). The converse is false 
(example 26.1.26). If, however, we restrict our attention to open subsets of M", then the converse 
does hold (proposition 26.1.27). 

26.1.25. Proposition. If a topological space is arcwise connected, then it is connected. 

26.1.26. Example. Let B = {{x,smx^^): < x < 1}. Then i? is a connected subset of but 
is not arcwise connected. 

Hint for proof . Let M = B. To show that M is not arcwise connected argue by contradiction. 
Assume there exists a continuous function /: [0, 1] — )• M such that /(I) £ B and /(O) ^ B. 
Prove that = vr2 o / is discontinuous at the point to = sup/^(M \B). To this end show that 
to £ f^{M \B). Then, given 6 > 0, choose ti in [0, 1] so that to < ti < to + S. Without loss of 
generality one may suppose that /2(to) < 0. Show that (/^)~'[to, ti] is an interval containing and 
/^(ti) (where = tti o /). Find a point t in [to, ti] such that < /^t) < f^ih) and /^(t) = 1. 
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26.1.27. Proposition. Every connected open subset o/M" is arcwise connected. 
Proof. See [xi], solution Q.17.7. 

26.2. Disconnectedness Comes in Many Flavors 

26.2.1. Definition. A topological space is totally disconnected if all its components are 
points. It is EXTREMALLY DISCONNECTED if the closure of every open set is open. It is zero- 
DIMENSIONAL if it has a base consisting of sets which are both closed and open. 

26.2.2. Example. Recall that the indiscrete topology on a set X has only two open sets, and X. 
Let a set (with at least two elements) have the indiscrete topology. Is the resulting space discrete? 
disconnected? extremally disconnected? totally disconnected? zero-dimensional? 

26.2.3. Example. Consider the three-point space of example 26.1.12. Is it Ti? Hausdorff? dis- 
crete? disconnected? extremally disconnected? totally disconnected? zero-dimensional? 

26.2.4. Example. Is the set Q of rational numbers discrete? disconnected? extremally discon- 
nected? totally disconnected? zero-dimensional? 

26.2.5. Example. Let V = {1/n: n e N}, W = {0}UV, and X = {*}UW (where =f= is any element 
not in W). Let a subset U belong to T if C/ is an open subset of W or if {U \ {*}) U {0} is an open 
subset of W. Check that T is a topology for X. Is the resulting topological space Ti? Hausdorff? 
discrete? disconnected? extremally disconnected? totally disconnected? zero-dimensional? 

26.2.6. Example. The set of intervals of the form [a, h) (where a < 6) is a base for a topology on 
the set M of real numbers. The resulting topological space is sometimes called the Sorgenfrey line. 
In this space find the closure of the sets 

{l/n:nGN} and {-l/n:nGN}. 

Is the Sorgenfrey line Til Hausdorff? discrete? disconnected? extremally disconnected? totally 
disconnected? zero-dimensional? 

26.2.7. Example. Let N have the cofinite topology. (The nonempty open sets are the ones 
with finite complement.) Is this space discrete? disconnected? extremally disconnected? totally 
disconnected? zero-dimensional? 

26.2.8. Exercise. In topological spaces what relationships (if any) exist between the properties 
discrete, extremally disconnected, totally disconnected, and zero-dimensional? 

26.2.9. Definition. A topological space X is separable if it has a countable dense subset. It is 
FIRST countable if each point in X has a countable neighborhood base. It is Lindelof if every 
open cover of X has a countable subcover. 

26.2.10. Example. The Sorgenfrey line (see example 26.2.6 is a separable first countable, Lindelof 
space which is not second countable. 

26.2.11. Exercise (The Moore plane). Consider H = {(a;,y) G M^: y > 0}. Put a topology T 
on H as follows. Basic neighborhoods of points (x, y) where y > will be the usual open disks 
(restricted to those that lie in ff, of course). A basic neighborhood of a point p on the x-axis 
will be a set of the form {p} U D where D is an open disk in H tangent to the x-axis at p. Show 
that T is a topology on H. Describe the operations of interior and closure in this space. Is this 
space separable? Ti? Hausdorff? connected? regular? normal? first countable? Lindelof? locally 
compact? Must a subspace of a separable topological space be separable? Hint. The question of 
normality of -fT is a little tricky. One approach is to suppose that for every irrational number p we 
are given an equilateral triangle Tp satisfying: 

(i) one vertex of Tp is the point (p, 0); and 

(ii) the side of Tp opposite this vertex lies above the x-axis and is parallel to it. 
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For every irrational p let hp be the height of the triangle Tp. For every natural number n let 
Dn = {p e Q," : hp > \}. Show that for some ra G N the closure of the set D„ (in the usual 
topology on M) contains some interval [a, h] (with a <h). Deduce from this that UpeQ'^ -^p contains 
a rectangular region of the form [a,h\ x (0, ^) (with n e N and a < b). What does this tell you 
about the possibility of separating Q x {0} and Q'^ x {0} by open sets? 
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MODES OF CONVERGENCE 

27.1. Functions on Positive Measure Spaces 

27.1.1. Definition. Let (5, /x) be a positive measure space, (/„) be a sequence of measurable real 
valued functions on S, and g: 5 ^ M be measurable. 

(1) (i) The sequence (/„) CONVERGES TO g ALMOST everywhere (written fn^g (fi-a.-e.) 
or just fn^ 9 (a.e.) ) if //({x G S : fn{x) ^ 9{x)}) = 0. 

(ii) The sequence (/„) CONVERGES TO g ALMOST UNIFORMLY (written fn^9 (a.u.) ) if for 

m 

every e > there exists ECS such that fn^g uniformly on E and nyE'^) < e. 

(iii) The sequence (/„) CONVERGES TO g IN MEASURE (written fn^g (meas) or fn—^g) if 
for every e > 

li{{xeS: |/„(x)-5(x)| >€}) ^0 

as n — >■ oo. 

(iv) If (/„) is a sequence of integrable functions and g is integrable, then the sequence (/„) 
CONVERGES TO g IN MEAN (written fn—^g (mean) ) if Jg\fn - g\ d/i ^ as n ^ oo. This 
is just the usual seminorm convergence in the space Ci{S,^). 

27.1.2. Proposition. Let (5*,//) be a positive measure space, (fn) be a sequence of measurable 
functions on S, and g and h be measurable functions on S. If fn g (iJ'-a.e.) and fn h 
(fj,-a.e.), then g = h fi-a.e. 

27.1.3. Proposition. Let S be a positive measure space. In the space of measurable functions on 
S almost uniform convergence implies almost everywhere convergence. 

27.1.4. Corollary. Let (S*, /i) be a positive measure space, ( fn) be a sequence of measurable func- 
tions on S, and g and h be measurable functions on S. If fn ^ g (a.u.) and fn~^h (a.u.), then 
g = h fi-a.e. 

27.1.5. Proposition. Let (S, //) be a positive measure space, {fn) be a sequence of measurable 

functions on S, and g and h be measurable functions on S. If fn—^g and fn ^ > h, then g = h 
jjL-a.e. 

27.1.6. Proposition. Let S be a positive measure space. In the space of measurable functions on 
S almost uniform convergence implies convergence in measure. 

27.1.7. Proposition. Let S be a positive measure space. In the space of measurable functions on 

S convergence in mean implies convergence in measure. 

27.1.8. Corollary. Let {S,fi) be a positive measure space, ( fn) be a sequence of measurable func- 
tions on S, and g and h be measurable functions on S. If fn ^ g (mean) and fn^h (mean), then 
g = h fi-a.e. 

27.1.9. Example. Define a sequence ( J„) of subsets of the unit interval [0, 1] as follows: Ji = [0, 1], 

J2 = [0,^], J3 = [5,1], J4 = J5 = Ihl], J& = [0,5], Jj = [0, jJjJs = [1,5], J'i = [\,\], 

Jio = [f,l], Jii = J12 = [|,|], ••• • For each n G N let /„ = Xj^- The functions (/„) 

are sometimes called the dancing functions. With respect to Lebesgue measure on M the sequence 
converges in mean and in measure, but not almost uniformly or almost everywhere. 
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27.1.10. Example. In the following diagram the three arrows indicate the implications proved 
in propositions 27.1.3, 27.1.6, and 27.1.7. Altogether there are twelve possible implications. Give 
examples to show that none of the nine remaining implications hold. 

a.u. a.e. 



mean *- meas 

27.1.11. Proposition. Let S be a positive measure space. Suppose that in the space of measurable 
functions on S a sequence (fn) converges in measure to a function g . Then there exists a subsequence 
{frik) which converges to g almost uniformly. 

Proof. See [ '], theorem 2.5.3. 

27.1.12. Corollary. Let S be a positive measure space. Suppose that in the space of measurable 
functions on S a sequence (fn) converges in measure to a function g. Then there exists a subsequence 
{frik) which converges to g almost everywhere. 

27.1.13. Corollary. Let S be a positive measure space. Suppose that in the space of measurable 
functions on S a sequence (/„,) converges in mean to a function g. Then there exists a subsequence 
{fn^) which converges to g almost uniformly. 

27.1.14. Corollary. Let S be a positive measure space. Suppose that in the space of measurable 
functions on S a sequence (fn) converges in mean to a function g. Then there exists a subsequence 
{fn,,) which converges to g almost everywhere. 

27.2. Functions on Finite Measure Spaces 

27.2.1. Proposition (Egoroff's theorem). Let S be a positive real measure space. In the space of 
measurable functions on S convergence almost everywhere implies almost uniform convergence. 

Proof. See [2], theorem 13.7; [11], proposition 3.1.3; [18], theorem 2.33; and [ 3], theorem 
11.32. 

27.2.2. Corollary. Let S be a positive real measure space. In the space of measurable functions 
on S convergence almost everywhere implies convergence in measure. 

27.2.3. Example. In the following diagram the five arrows indicate the implications proved in 
section 27.1, proposition 27.2.1, and corollary 27.2.2, all of which hold for finite positive measure 
spaces. Give examples to show that none of the seven remaining implications hold. 

a.u. > a.e. 




mean 5- meas 

An interesting consequence of Egoroff's theorem 27.2.1 is that measurable functions on sets of 
finite measure are, in some sense, almost continuous. This is Lusin's theorem. 

27. 2 A. Theorem (Lusin). Let ^ be a Radon measure on a locally compact Hausdorff space X 
and f be a measurable complex valued function whose support has finite measure. Then for every 
e > there exists a continuous function g: X ^ C with compact support which agrees with f except 
perhaps on a measurable set E such that ^{E) < e. Furthermore, if f is bounded, then g may be 
chosen so that \\g\\u < ll/IU- 
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Proof. See [18], Theorem 7.10. 

Lusin's theorem admits some rather far-reaching generahzations. For a samphng of these see [4], 
theorem 4.3.16 and corollaries 4.3.17; [I 1], theorem 7.4.3; [23], theorem 11.36; and [37], theorem 
2.24. 

27.2.5. Proposition. Let fi be a Radon measure on a locally compact Hausdorff space X. Then 
Cc{X) is dense in Lp(X, /x) for 1 <p < oo. 

27.3. Dominated Convergence 

27.3.1. Definition. We say that a sequence (/„) of real (or complex) valued functions on a set S 
is DOMINATED by a positive function /i if < h on S. 

The first result of this section is an easy consequence of the Lebesgue dominated convergence 
theorem. 

27.3.2. Proposition. Let (5, /u) be a measure space and (fn) be a sequence of measurable functions 
on S which is dominated by an L\ function h. If (fn) converges almost everywhere to a measurable 
function g, then g is integrable and (fn) converges in mean to g. 

27.3.3. Proposition. Let (5*, /u) be a measure space and (fn) be a sequence of measurable functions 
on S which is dominated by an L\ function h. If (/„,) converges almost everywhere to a measurable 
function g, then[fn) converges to g almost uniformly. 

27.3.4. Proposition. Let S be a measure space and ( fn) be a sequence of measurable functions on 
S which is dominated by an Ci function h. If (fn) converges in measure to a measurable function 
g, then it converges in mean to g. 

27.3.5. Corollary. Let S be a measure space and {fn) be a sequence of measurable functions on 
S which is dominated by an C\ function h. If (fn) converges almost uniformly to a measurable 
function g, then it converges in mean to g. 

27.3.6. Corollary. Let S be a measure space and {fn) be a sequence of measurable functions on 
S which is dominated by an Ci function h. If {fn) converges to a measurable function g almost 
everywhere, then it converges to g in measure. 

27.3.7. Example. In the following diagram the eight arrows indicate the implications proved in 
this section and in section 27.1. Give examples to show that none of the four remaining implications 
hold (even if the space has finite measure). 



a.u. > a.e. 




mean < - meas 

27.4. Convergence in Measure is Topological 

27.4.1. Proposition. Let V be a vector space and s: y — )• M 6e a function that satisfies: 

(i) s{x + y) < s{x) + s{y), 

(ii) s{—x) = s{x), and 

(iii) s(0) = 

for all X, y gV. Define p{x,y) = s{x — y) for x, y €zV. Then p is a pseudometric on V . 
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27.4.2. Proposition. Let (S, fi) be a measure space and M.{S) he the vector space of all measurable 
real valued functions on S. For each f G A^(5) and a > let E{f,a) = {x £ S: \ f{x)\ > a}. 
Define 

s: M{S) ^M+: inf {arctan(a + /u(^(/,a)))} . 

Thus fi, acting through the function s, induces a topology on Ai(S). 

Hint for proof . Show that E(f + g,a + (3) C E{f,a) U E(g,(3). Use the preceding proposi- 
tion 27.4.1. 

The following result shows that convergence in measure is a topological notion. 

27.4.3. Proposition. Let (5, /i) be a measure space, (fn) be a sequence in Ai{S), and g G A4{S). 
Then fn^gin the topology induced on Ai{S) by /x (see the preceding proposition 27.4-2) if and 
only if fn^ g (meas) . 

Hint for proof . Notation as in the two preceding propositions. For g, h €z M{S) and e, 77 > 
prove that 

(7) < minjarctane, arctanr/} =^ ^{^E(h — g,r])) < e. 
Prove the contrapositive. Assume that fi{E{h — g,ri)) > e. Show that for a > 

arctan(a + n{E{h — g,a))^ > minjarctane, arctanry}. 
(Look at the cases a < r] and a > r] separately.) 
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OPERATORS ON HILBERT SPACES 

28.1. Orthonormal Bases 

28.1.1. Definition. Let i7 be a Hilbert space and AC. H. For every F G Fin A define 

Then s = {sf) p^p^^j^^ is a net in H. If this net converges, the set A is said to be summable; the 
Umit of the net is the SUM of A and is denoted by J2 ^- Indexed sets require a shghtly different 
notation. Suppose, for example, that A = {xi : i E 1} where / is an arbitrary index set. Then for 
each F G Fin/ 

= ^ (= ^{xi : i G F}) . 
ieF 

As above s is a net in H. If it converges {xi : i G /} is summable and its sum is denoted by X^j^/ Xi 
(or by "^{xi : i G /}). An alternative way of saying that {xi : i G /} is summable is to say that the 

"series" J2iei ^« converges or that the "sum" J2iei ^« exists. 

28.1.2. Remark. One frequently finds some version of the following "proposition" in textbooks. 
If Yliei Xi = u and X^jg/ xi = v \n. a. Hilbert (or Banach) space, then u = v. 

(I once saw a 6 line proof of this.) Of which result is this a trivial consequence? 

28.1.3. Proposition. If {xi: i G 1} and {yi: i G /} are summable subsets of a Hilbert space and 
a is a scalar, then {axi : i E 1} and {xi + yi: i E 1} are summable; and 

^^^^ CXX'i — CX ^^^^ 

iei iei 

and 

^Xi + ^yi = ^{xi + yi) . 
iei iei iei 

28.1.4. Proposition. If {xi: i G 1} is a summable subset of a Hilbert space H and y is a vector 
in H, then {{xi,y) : i E 1} is a summable set of scalars and 

^{xi,y) = (y2xi,y 

iei ^ iei 

28.1.5. Proposition. Let H be a Hilbert space and M ^ H. For every x E H there exists a unique 
vector y E M such that x — y E M-*- . 

28.1.6. Definition. Let H he a Hilbert space and M ^ H. Define a function 

Pm : H ^ H: x^y 

where y is the vector whose existence and uniqueness was asserted in the preceding proposition. 

28.1.7. Proposition. Let H be a Hilbert space and M ^ H. Then the map Pm defined above is 
a (bounded linear) operator on H and \\Pm\\ < 1- 
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28.1.8. Proposition. Let H be a Hilhert space and M 4 H. Then Pm'^ = Pm, kerPAf = M^, 
and ran Pa/ = M . 

28.1.9. Definition. A nonempty subset of a Hilbert space is orthonormal if e _L / for every 
pair of distinct vectors e and f m E and ||e|| = 1 for every e G E. 

28.1.10. Proposition (Gram-Schmidt orthonormalization). If (v^) is a linearly independent se- 
quence in an inner product space V, then there exists an orthonormal sequence (e^') in V such that 
span{f ^, . . . , v"^} = span{e^, . . . , e^} for every n £ N. 

28.1.11. Proposition. In a Hilbert space H let P be the orthogonal projection onto the subspace 
M which is the closed linear span of an orthonormal set E of vectors in H . Then for every x £ H 
we have Px = ^{ (x, e) e : e G E} . 

28.1.12. Proposition (Bessel's inequality). Let H be a Hilbert space and E be an orthonormal 
subset of H . For every x £ H 

Y^\\{x,e)f<\\x\\\ 

28.1.13. Proposition. Let E be an orthonormal set in a Hilbert space H. Then the following are 
equivalent. 

(i) E is a maximal orthonormal set. 

(ii) If X J- E, then x = (E is totalj. 

(iii) \J E = H (E is a complete orthonormal set J. 

(iv) X = ^^(a;, e)e for all x £ H. (Fourier expansion) 

(v) {x,y) = ^^(x,e)(e, y) for all x,y £ H. (Parseval's identity.) 

(vi) ||x||^ = ^^|(x,e)|^ for all x £ H. (Parseval's identity.) 

28.1.14. Definition. If if is a Hilbert space, then an orthonormal set E satisfying any one (hence 
all) of the conditions in proposition 28.1.13 is called an orthonormal basis for H (or a complete 
ORTHONORMAL SET in H, or a HiLBERT SPACE BASIS for H). Notice that this is a very different 
thing from the usual (Hamel) basis for a vector space (see 5.3.12). 

28.1.15. Proposition. If E is an orthonormal subset of a Hilbert space H, then there exists an 
orthonormal basis for H which contains E. 

28.1.16. Example. For each n G N let e" be the sequence in I2 whose n^^ coordinate is 1 and all 
the other coordinates are 0. Then {e": n £ N} is an orthonormal basis for I2. This is the USUAL 
ORTHONORMAL BASIS for I2. 

28.1.17. Exercise. A function / : R — ;> C of the form 

n 

f{t) = ao + ^^(ofc cos kt + 6fc sin kt) 

k=l 

where oq, • . . , a„, 61, . . . , 6„ are complex numbers is a trigonometric polynomial. 

(a) Explain why complex valued 27r-periodic functions on the real line M are often identified 
with complex valued functions on the unit circle T. 

(b) Some authors say that a trigonometric polynomial is a function of the form 

n 

fit) = cue'^' 

k=—n 

where ci, . . . , c„ are complex numbers. Explain carefully why this is exactly the same as 
the preceding definition. 
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(c) Justify the use of the term trigonometric polynomial 

(d) Prove that every continuous 27r-periodic function on M is the uniform hmit of a sequence 
of trigonometric polynomials. 

28.1.18. Proposition. Consider the Hilhert space L2([0, 27r], C) with inner product 



28.1.19. Exercise. Calculate > —k. Hint. Let /(x) = x for < x < 2it. Regard / as a member 



of L2([0,27r],C). Find ||/|| in two ways. 

28.2. Adjoints of Hilbert Space Operators 

28.2.1. Proposition. Let S, T £ ^{H, K) where H and K are Hilhert spaces. If {Sx, y) = {Tx, y) 
for every x £ H and y £ K , then S = T. 

28.2.2. Proposition. If H is a complex Hilbert space and T € satisfies {Tz,z) = for all 
z £ H , then T = 0. The corresponding result fails in real Hilbert spaces. 

Hint for proof . In the hypothesis replace z first hy x + y and then by x + iy. 

Despite our convention made at the beginning of section 20.6 that in these notes all Hilbert 
spaces are complex, the word "complex" was added to the hypotheses of the preceding proposition 
to draw attention to the fact that it is one of the few facts which holds only for complex spaces. 
While proposition 28.2.1 holds for both real and complex Hilbert spaces 28.2.2 does not. (Consider 
the operator which rotates the plane M? by 90 degrees.) 

28.2.3. Definition. A function T : V ^ W between complex vector spaces is conjugate linear 
if it is additive {T(x + y) = Tx + Ty) and satisfies T{ax) = aTx for all x £ V and a £ C A 
complex valued function of two variables (j): V x W ^ C is a, sesquilinear functional if it is 
linear in its first variable and conjugate linear in its second variable. 

If H and K are normed linear spaces, a sesquilinear functional (p on H x K is bounded if there 
exists a constant M > such that 



for all x £ H and y £ K. 

28.2.4. Proposition. If (p: H ® K ^ C is a hounded sesquilinear functional on the direct sum of 
two Hilbert spaces, then the following numbers (exist and) are equal: 

• sup{|(/>(x,y)| : ||x|| < 1, ||y|| < 1} 

• sup{|0(x,y)| : ||x|| = ||y|| = 1} 



• inf{Af > 0: \4){x,y)\ < M\\x\\ \\y\\ for all x, y £ H}. 

Hint for proof . The proof is virtually identical to the corresponding result for linear maps 
(see 12.2.6). 

28.2.5. Definition. Let (p: H (B K ^ C he a bounded sesquilinear functional on the direct sum 
of two Hilbert spaces. We define ||0||, the norm of cp, to be any of the (equal) expressions in the 
preceding result. 




For every integer n (positive, negative, or zero) let e"'(x) = e*"^ for all x £ [0, 2tt]. Then the set of 
these functions e" for n £ Z is an orthonormal basis for L2{[0, 27r], C). 




k=l 



\^{x,y)\ < M||x||||y 
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28.2.6. Proposition. Let T: H ^ K be a bounded linear map between Hilbert spaces. Then 
(j): H (B K ^ C: (x, y) i— )• {Tx, y) is a bounded sesquilinear functional on H ® K and = \\T\\ . 

28.2.7. Proposition. Let (p: H x K ^ C be a bounded sesquilinear functional on the product of 
two Hilbert spaces. Then there exist unique bounded linear maps T G ^{H,K) and S G ^{K,H) 
such that 

Hx,y) = {Tx,y) = {x,Sy) 
for allxeH and y e K. Also, \\T\\ = \\S\\ = ||0||. 

Hint for proof . Show that for every x & H the map y i— )• 4>{x, y) is a bounded hnear functional 
on K. 

28.2.8. Definition. Let T: H ^ K he a, bounded hnear map between Hilbert spaces. The 
mapping (x, y) i— )■ (Tx, y) from H®K into C is a bounded sesquilinear functional. By the preceding 
proposition there exists an unique bounded linear map T* : K ^ H called the adjoint of T such 
that 

{Tx,y) = {x,T*y) 
for allx e H and y £ K. Also, ||r|| = ||r*||. 

28.2.9. Example. Recall from example 20.6.4 that the family I2 of all square summable sequences 
of complex numbers is a Hilbert space. Let 

S: h- (xi, X2, X3, . . . ) i-> (0, xi, X2, . . . ) . 

Then S is an operator on /2, called the unilateral shift operator, and ||5|| = 1. The adjoint 
S* of the unilateral shift is given by 

S*: h ^ h- {xi,X2, X3, . . . ) I-)- (x2, X3, X4, . . . ). 

In 16.2.15 we defined multiplication operators on C{X) where X is a topological space. Multipli- 
cation operators can be defined on many spaces; there is nothing special in this respect about C{X). 
For example, we can just as well consider multiplication operators on spaces of square integrable 
functions. 

28.2.10. Example. Let {S,fi) be a sigma-finite measure space and L2{^) be the Hilbert space of 
all (equivalence classes of) complex valued functions on S which are square integrable with respect 
to //. Let (j) be an essentially bounded complex valued ^-measurable function on S. Define 

on L2{n) by M^{f) := (j)f. Then is an operator on ^2(6'); it is called a MULTIPLICATION 
OPERATOR. Its norm is given by HM^H = ||(/'||oo and its adjoint by M^* = M-^. 

28.2.11. Example. Let {S,fj,) be a sigma-finite measure space and L2{^) be the Hilbert space of 
all (equivalence classes of) complex valued functions on S which are square integrable with respect 
to //. li k: S X S ^ C \s square integrable with respect to the product measure // x /i on 5 x 5, 
define K on L2{fJ.) by 

(K/)(x)= / k{x,y)f{y)dy. 
Jo 

Then K is an INTEGRAL operator and its kernel is the function k. The adjoint K* of K is also 
an integral operator and its kernel is the function k* defined on x by k*(x,y) = k{y,x). 

28.2.12. Exercise. Let V be the integral operator on L2{ [0, 1] ) whose kernel is the characteristic 
function of {(x, y) G [0, 1] x [0, 1] : y < x}. 

(a) Find a simple expression for Vf(x). 

(b) Show that V is an operator on H. 

(c) Show that V is injective. 

(d) Compute V* . 

(e) What is V + V*7 What is its range? 
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28.2.13. Proposition. If S and T are operators on a Hilbert space H and a G C, then 
(i) (S + T)* = S* +T*; 



(ii) {aTf =aT*; 

(iii) r** = T; and 

(iv) {TSy = S*T*. 



Notice that, despite the fact that every Hilbert space "is" a Banach space, the adjoint of a 
Hilbert space operator is defined differently from the adjoint of a Banach space operator (see 15.4.6). 
For an operator T between Banach spaces (ii) above becomes (aT)* = aT* . 

28.2.14. Proposition. Let T be an operator on a Hilbert space H . Then 



28.2.15. Proposition. If T is an operator on a Hilbert space, then 

(i) kerr* = (ranr)-^, 

(ii) ranT* = (kerT)-*-, 

(iii) kerT = (ranT*)-*-, and 

(iv) ranT = (kerr*)-^. 

28.2.16. Proposition. An operator T on a Hilbert space H is self-adjoint if and only if {Tx,x) G M 
for every x €z H . 

28.2.17. Proposition. The pair of maps H ^ H and T ^ T* taking every Hilbert space to itself 
and every bounded linear map between Hilbert spaces to its adjoint is a contravariant functor from 
the category HILqo of Hilbert spaces and bounded linear maps to itself. 

28.2.18. Example. Define an operator V on the Hilbert space H = L2([0, oo)) by 



Then V is an isometry but not an isomorphism in the category HILi of Hilbert spaces and con- 
tractive linear maps. What changes if H = L2(M)? 
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OPERATORS ON BANACH SPACES 

29.1. The Hahn-Banach Theorems 

29.1.1. Definition. A real valued function / on a real vector space y is a sublinear functional 
if f{x + y) < f{x) + f{y) and f{ax) = af{x) for all x, y £ V and a > 0. 

29.1.2. Example. A norm on a real vector space is a sublinear functional. 

29.1.3. Theorem (Hahn-Banach theorem I). Let M be a subspace of a real vector space V and p 
be a sublinear functional on V. If f is a linear functional on M such that f < p on AI , then f has 
an extension f to all of V such that f <p onV . 

Proof. See [i.^], theorem HI. 6. 2 and lemma HI. 6. 9; [23], theorem 14.9; and [32], theorem 
10.1. 

29.1.4. Theorem (Hahn-Banach theorem II). Let M be a vector subspace of a (real or complex) 
normed linear space V. Then every continuous linear functional f on M has an extension f to all 
ofV such that WfW = 11/11 . 

Proof. See [23], corollary 14.11 and theorem 14.12; [32], theorem 10.2 and corollary 10.1; 
and [37], theorem 5.16 and proposition 5.17. 

29.1.5. Exercise. Your buddy Fred R. Dimm thinks he has discovered a really simple direct 
proof of the version of the Hahn-Banach theorem given in proposition 29.1.4: A continuous linear 
functional / defined on a (linear) subspace M of a normed linear space V can be extended to a 
continuous linear functional / on all of V without increasing its norm. He says that all one needs 
to do is let be any vector space complement of M — that is, a linear subspace N such that, as 
vector spaces, V = M (B N — and let -B be a (Hamel) basis for N. Define /(e) = for every vector 
e G B and / = / on M. Then extend / by linearity to all of V. Surely, Fred says, / is linear by 
construction and its norm has not increased since we have made it zero where it was not previously 
defined. Show Fred the error of his ways. 

29.1.6. Theorem (Hahn-Banach theorem HI). Let V be a (real or complex) normed linear space, 
M ^ V, and x £ V. Then there exists a functional f £ V* such that f^(M) = {0}, f(x) = 
dix,M), and \\f\\ < 1. 

Proof. See [23], corollary 14.13; and [: , ], theorem 10.3. 

The most useful special case of the preceding is when M = {0} and x ^ 0. This tells us that 
the only way for f{x) to vanish for every f £ V* is for x to be the zero vector. 

29.1.7. Corollary. Let V be a normed linear space and x G V. If f(x) = for every f G V* , then 
x = 0. 

29.1.8. Corollary. If V is a normed linear space, then V* separates points ofV. 

29.1.9. Corollary. Let V be a normed linear space and x G V. Then there exists f £ V* such 
that 11/11 = 1 and f{x) = \\x\\. 
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29.1.10. Proposition. Let V be a normed linear space, {xi, . . . ,Xn} be a linearly independent 
subset of V , and ai,...,a„ be scalars. Then there exists an f in V* such that f{xk) = Ofc for 
l<k<n. 

Hint for proof. For x = Y2=i Pk^k let g{x) = Y.^^^ auPk- 

29.1.11. Proposition. Let V be a normed linear space and ^ x £ V . Then 

\\x\\ = max{|/(x)| : f eV* and \\f\\ < 1} . 

The use of max instead of sup in the preceding proposition is intentional. There exists an / in 
the closed unit ball of V* such that f{x) = \\x\\. 

29.1.12. Proposition. If B is a Banach space the map tb'- B ^ B** : x i— )• x** (defined in 
example 25.3.3) is an isometric linear map. 

29.1.13. Definition. A Banach space B is reflexive if the map tb is onto. 

Example 25.3.3 shows that in the category of Banach spaces and bounded linear maps, the 
mapping r: S i— >■ is a natural transformation from the identity functor to the second dual 
functor. With the Hahn- Banach theorem we can say more. 

29.1.14. Proposition. In the category of reflexive Banach spaces and bounded linear maps the 
mapping t: B ^ tb is a natural equivalence between the identity and second dual functors. 

29.1.15. Example. Every Hilbert space is reflexive. 

29.1.16. Exercise. Let V and W be normed linear spaces, ?7 be a nonempty convex subset of V, 
and f : U ^ W. Without using any form of the mean value theorem show that if d/ = on U, 
then / is constant on U. 

29.1.17. Example. Let be the Banach space of all bounded sequences of real numbers and c 
be the subspace of /qo comprising all sequences x = such that lim„_j.oo Xn exists. Define 

/ : c — )• M : X I— )• lim Xn- 
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Then / is a continuous linear functional on c and ||/|| = 1. By the Hahn-Banach theorem f 
has an extension to / G /qo* with ||/|| = 1. For every subset ^ of N define a sequence (x^) by 

^" ^ { o' if n ^ ^' ^^"^^ define ^x: *P(N) M: A i-^ ?{xn)- Then ^ is a finitely additive set 
function but is not countably additive. 

29.2. Banach Space Duality 

29.2.1. Notation. Let S be a normed linear space, M C B, and F Q B* . Then we define 

M-L := {f £ B*: f{x) = for all x £ M} 
:= {x e B: f{x) = for ah / G F} 

The set M-*- is the annihilator of M and Fj_ is the pre-annihilator of F. 

29.2.2. Proposition ( Annihilators) . Let B be a Banach space, M C B, and F C B* . Then 

(a) IfMONC B, then C . 

(b) IfF<ZG<^ B*, then G± C 

(c) M"*" is a closed linear subspace of B* . 

(d) F± is a closed linear subspace of B. 

(e) M C M-L_L. 

(f) F C F±^. 

(g) M^^=\/M. 

(h) = B* if and only if M = {0}. 



29.3. THE BAIRE CATEGORY THEOREM 



215 



(i) F± = B if and only if F = {0}. 

(j) If M is a linear subspace of B, then M-*- = {0} if and only if M is dense in B. 
(k) If B is reflexive (that is, if the natural embedding t: B ^ B** is an isomorphism), then 
= V-^- Hint. Show that t{F±) = F-^ and that therefore F±-^ = F-^±. 

29.2.3. Proposition. Let T G 5S(-B,C) where B and C are Banach spaces. Then 

(a) kerT* = (ranr)-^. 

(b) kerT = (ran r*)x. 

(c) ranT = (kerT*)_L. 

(d) ranT* C (kerT)-*". 

29.2.4. Example. The closed unit ball in I2 is not compact. 

29.2.5. Definition. Let i? be a Banach space. The weak topology on B is the weak topology 
induced by the elements of B* . (So, of course, the weak topology on the dual space B* is the weak 
topology induced by the elements of B** .) When a net {x\) converges weakly to a vector o in S 

we write x\ — a. The t(;*-TOPOLOGY (pronounced weak star topology) on the dual space B* is 
the weak topology induced by the elements of ranr where r is the natural injection of B into B** . 

When a net (/a) converges weakly to a vector g in B* we write f\ — ^^^-^ g. Notice that when B is 
reflexive (in particular, for Hilbert spaces) the weak and weak star topologies on B* are identical. 

29.2.6. Proposition. Let f & B* where B is a Banach space. For every e > and every finite 
subset F Q B define 

U{f; F; e) = {g€B*: \ f{x) - g{x)\ < e for all x € F}. 
The family of all such sets is a base for the w* -topology on B* . 

29.2.7. Theorem (Alaoglu's theorem). The closed unit ball of a Banach space is compact in the 
vu* -topology. 

Proof. See [8], theorem 15.11; ["^], theorem V.3.1; and [" '], theorem 1.23. 

29.3. The Baire Category Theorem 

As a matter of logical organization this section should probably appear in chapter 20 because 
of its close connection with complete metric spaces. But because the material is a bit complicated 
and several new concepts are introduced, I thought it better to postpone introducing the Baire 
category theorem until it is actually needed, which it is in the next section for the proof of the open 
mapping theorem. 

The word "category" in the title Baire category theorem has nothing to do with the modern 
algebraic sense of the word that we introduced earlier (in chapter 8). In the older usage of the 
word there are two types of topological spaces: those of first category that can be written as a 
countable union of nowhere dense sets, and everything else, known as sets of second category. This 
language is neither attractive nor helpful, but it is classical. Quite a few authors (Semadeni [41], 
for example), apparently disliking these unsuggestive terms, have replaced first category and second 
category with meager and non-meager, respectively. Central to the discussion is the topological 
notion of nowhere dense. 

29.3.1. Definition. A subset of a topological space is nowhere dense (or rare) if its closure 
has empty interior. 

29.3.2. Example. The set Z of integers is nowhere dense in the real line M. The set Q of rational 
numbers fails to be nowhere dense in M. 
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Notice that being nowhere dense is not an intrinsic property of a set. (Recah the discussion of 
open and closed in section 11.5.) A set (with a fixed topology) may be nowhere dense in one space 
and fail to be nowhere dense in another. If there is only one topological space under discussion, it 
is unambiguous to say A is a nowhere dense set. If, however, there is more than one space that 
contains A, be explicit and say A is nowhere dense in X, if that is what you intend. 

29.3.3. Example. Let X be the x-axis in and A = {(r, 0): r G Q}. Then A is nowhere dense 
in but fails to be nowhere dense in the subspace X. 

Despite the preceding example we do have the following useful positive result. 

29.3.4. Proposition. Let Y be a subspace of a topological space X and A (^Y. If A is nowhere 
dense in Y , then it is nowhere dense in X. Furthermore, if, additionally, Y is an open subset of 
X, then A is nowhere dense in Y if and only if it is nowhere dense in X . 

29.3.5. Example. The set {/ G C([0, 1]): \ f{x) - f{y)\ < |x - y| for ah x, y G [0, 1]} is nowhere 
dense in C([0, 1]). 

Here are some elementary observations. 

29.3.6. Proposition. The following hold in any topological space. 

(a) A set is nowhere dense if and only if its closure is. 

(b) Every subset of a nowhere dense set is nowhere dense. 

(c) The union of a finite family of nowhere dense sets is nowhere dense. 

The key to understanding many arguments related to the Baire category theorem is a realization 
that in general topological spaces the closed nowhere dense sets are the complements of open dense 
sets. 

29.3.7. Proposition. A subset of a topological space is open and dense if and only if its complement 
is closed and nowhere dense. 

29.3.8. Corollary. A topological space is a countable union of nowhere dense sets if and only if 
there exists a countable family of open dense sets in the space whose intersection is empty. 

29.3.9. Definition. A subset of a topological space is of first category (or is meager) in the 
space if it is a countable union of nowhere dense sets. It is of second category (or is nonmeager) 
if it is not of first category. A subset is residual if its complement is of first category. 

29.3.10. Example. The set of rational numbers, while dense in the reals, is a countable union of 
nowhere dense sets, so is of first category in M. 

With this new language we may rephrase corollary 29.3.8. 

29.3.11. Corollary. A topological space X is of second category (in itself) if and only if every 
countable family of dense open sets in X has nonempty intersection. 

Again, a reminder. Neither of these concepts is intrinsic. If, in some particular context, a set 
A is contained in two topological spaces, say X and y, it is necessary to be explicit and specify 
that A is, say, of first category in X or of first category in Y . 

29.3.12. Proposition. Let Y be a subspace of a topological space X and A^Y . If A is of first 
category in Y , then it is of first category in X. In particular, any subspace of X which is of first 
category in itself is also of first category in X. 

29.3.13. Example. It is easy to see that the preceding result does not hold for second category 
sets. The set N is of second category in itself, but not in M. 

Here is a fundamental fact. 

29.3.14. Proposition. If X is a nonempty topological space, then the following are equivalent: 
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(a) Every residual subset of X is dense. 

(b) Every nonempty open subset of X is of second category. 

(c) Every first category subset of X has empty interior. 

(d) The intersection of every countable family of open dense subsets of X is dense in X . 

29.3.15. Definition. A topological space is a Baire space if the intersection of every countable 
family of dense open subsets is dense. Obviously, had we so chosen, we could have used (a), (b), 
or (c) of the preceding proposition 29.3.14 instead of (d) for this definition. 

First we observe that the property of being a Baire space is G-hereditary. 

29.3.16. Exercise. Every nonempty open subset of a Baire space is itself a Baire space. 

29.3.17. Theorem (Baire category theorem I). Every complete metric space is a Baire space. 

Proof. Let M be a complete metric space and (?7„) be a sequence of dense open sets. What 
we want to show is that if 1/ is a nonempty open set in M, then the set V D fln^i ^« nonempty. 
We will generate a decreasing sequence of closed sets Bn, each contained in with nonempty 
intersection. 

Since V is nonempty and open and Ui is open and dense, we may choose an open ball Bi 
whose closure is contained in F n J7i and whose diameter is strictly less than 1. Now, since Bi is 
nonempty and open and U2 is open and dense, we may choose an open ball B2 whose closure is 
contained in V nU2 and whose diameter is strictly less than ^ . Proceed in this fashion to produce 
a decreasing sequence of closed sets B^ each with diameter strictly less than 1^ and contained in 
both V and By proposition 20.1.9 fl^i is nonempty. This shows that V n (fl^i ^n) is 
nonempty, as desired. □ 

Recall that by definition a Baire space is one in which the intersection of a sequence of dense 
open sets is dense and by corollary 29.3.11 a second category space is one in which the intersection 
of a sequence of dense open sets is nonempty. So, clearly, every nonempty Baire space is of second 
category in itself. The converse need not be true, however, even in metric spaces. 

29.3.18. Example. The space X = Q U (0, 1) (with the metric topology it inherits from R) is of 
second category (in itself) but is not a Baire space. 

Hint for proof . Let ri, r2, ... be an enumeration of the rational numbers which do not lie in 
(0, 1). Consider the sets ^ = (Q \ {r„}) U (0, 1). 

Making use of proposition 29.3.14 we can give another statement of the theorem. 

29.3.19. Theorem (Baire category theorem II). In a complete metric space every nonempty open 
set is of second category. 

One of the simplest and most common statements of the theorem is entirely adequate for most 
purposes. 

29.3.20. Theorem (Baire category theorem III). Every nonempty complete metric space is of 
second category. 

Clearly, propositions 29.3.7 and 29.3.14 make it possible to state Baire's theorem in many 
different guises. Here is one more. 

29.3.21. Theorem (Baire category theorem IV). If a nonempty complete metric space is the union 
of a sequence of closed sets, then one of these sets has nonempty interior. 

29.3.22. Example. The set Q of rational numbers is not a Gs subset of R. 
Hint for proof . Consider sets of the form R \ {r} where r is rational. 
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29.3.23. Example. Let = {/ G C([0, 1]) : /(r) is irrational for every rational number r in [0, 1]}. 
Then T is of second category in C([0, 1]). 

Hint for proof . For each pair (r, s) of rational numbers with r G [0, 1] and s arbitrary, consider 
the set Tr^s of all / G C([0, 1]) such that /(r) = s. 

29.3.24. Example. There exists a subset of the unit interval which is of first category but whose 
Lebesgue measure is 1. 

Hint for proof . Can you find a closed nowhere dense subset of [0, 1] whose measure is 1 — 1/n? 
Try Cantor-like sets. 

29.3.25. Proposition. Every proper finite dimensional subspace of a normed linear space is closed 
and nowhere dense. 

29.3.26. Proposition. A Hamel basis for an infinite dimensional Banach space is uncountable. 
Hint for proof . Use the preceding proposition 29.3.25. 

29.3.27. Exercise. Let I be the set of all sequences of real numbers which are eventually zero. 
Give / the usual pointwise vector space operations and the uniform norm. What is the cardinality 
of a Hamel basis for 1? Explain why your answer doesn't contradict the result in the preceding 
proposition 29.3.26. 

29.4. The Open Mapping Theorem 

29.4.1. Definition. A mapping f : X ^ Y between topological spaces is open if it takes open 
sets to open sets; that is, if f^{U) is open in Y whenever U is open in X. 

29.4.2. Theorem (Open mapping theorem). Every bounded linear surjection between Banach 
spaces is an open mapping. 

29.4.3. Example. Let Cu be the set of continuous real valued functions on [0, 1] under the uniform 
norm and Ci be the same set of functions under the Li norm ( ||/||i = /g |/| ). Then the identity 
map /: — 7- Ci is a continuous linear bijection but is not open. (Why does this not contradict the 
open mapping theorem?) 

29.4.4. Corollary. Every bounded linear bijection between Banach spaces is an isomorphism. 

29.4.5. Corollary. Every bounded linear surjection between Banach spaces is a quotient map 
in BAN. 

29.4.6. Example. Let / be the set of sequences of real numbers that are eventually zero, V be / 
equipped with the h norm, and W be / with the uniform norm. The identity map /: ^ — t- is a 
bounded linear bijection but is not an isomorphism. 

29.4.7. Example. The mapping I— 7- X shows that although bounded linear 
surjections between Banach spaces map open sets to open sets they need not map closed sets to 
closed sets. 

29.4.8. Proposition. Let V be a vector space and suppose that ||-||i and ||-||2 are norms on V 
whose corresponding topologies are Ti and %2- IfV is complete with respect to both norms and if 
Ti 5 12, then Ti = 12- 

29.4.9. Exercise. Does there exist a sequence (a„) of complex numbers satisfying the following 
condition: a sequence (x„) in C is absolutely summable if and only if (a„3;„) is bounded? Hint. 
Consider T : loo ^ h- (xn) ^ {xn/on). 
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29.4.10. Example. Let C^([a,6]) be the family of all twice continuously differentiable real valued 
functions on the interval [a,b]. Regard it as a vector space in the usual fashion. For each / E 
C2([a,6]) define 

11/11 = ||/||n + ||/'|U + ||/"L. 

This is in fact a norm and under this norm C^([a, 6]) is a Banach space. 

29.4.11. Example. Let C^([a, b]) be the Banach space given in exercise 29.4.10, and po and pi be 

members of C([a, 6]). Define 

T: C\[a,b])^C{[a,b]): f^f+pif+pof. 
Then T is a bounded linear map. 

29.4.12. Exercise. Consider a differential equation of the form 

y" + piy' +poy = q (*) 

where po, pi, and q are (fixed) continuous real valued functions on the interval [a, b]. Make precise, 
and prove, the assertion that the solutions to (*) depend continuously on the initial values. You 
may use without proof a standard theorem: For every point c in [a, b] and every pair of real numbers 
oq and ai, there exists a unique solution of (*) such that y(c) = and y'(c) = oi. Hint. For a 
fixed c G [a, b] consider the map 

S: C\[a,b])^C{[a,b]) x : / ^ (T/, /(c), /'(c)) 

where C^([a, b]) is the Banach space of example 29.4.10 and T is the bounded linear map of exam- 
ple 29.4.11. 

29.4.13. Definition. An bounded linear map T: V ^ W between normed linear spaces is 
BOUNDED AWAY FROM ZERO (or BOUNDED BELOw) if there exists a number 5 > such that 
\\Tx\\ > S\\x\\ for all X G V. (Recall that the word "bounded" means one thing when modifying 
a linear map and and something quite different when applied to a general function; so does the 
expression "bounded away from zero". See definition 22.3.4.) 

29.4.14. Proposition. A bounded linear map between Banach spaces is bounded away from zero 
if and only if it has closed range and zero kernel. 

29.4.15. Theorem (Fundamental quotient theorem for BALG). Let A and B be Banach algebras 
and I be a proper closed ideal in A. If cf) is a homomorphism from A to B and ker(/) D I, then there 
exists a unique homomorphism <p: A/I ^ B which makes the following diagram commute. 

A 

IT 

A/I ^B 

Furthermore, (p is injective if and only if ker (p = I; and (f) is surjective if and only if (f) is. 

29.5. The Closed Graph Theorem 

29.5.1. Theorem (Closed graph theorem). Let T: B ^ C be a linear map between Banach spaces. 
If the graph of T is closed in B ® C , then T is bounded. 

Hint for proof. Let G = {{x,Tx): x £ B} he the graph of T. Apply (corollary 29.4.4 of) 
the open mapping theorem to the map 

t:: G ^ B: {x,Tx)^ X . 
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29.5.2. Proposition. Let T: B ^ C he a linear mapping between Banach spaces. Then T is 
continuous if and only if the following condition is satisfied: 

if ^ in B and Txn c in C , then c = 0. 

29.5.3. Proposition. Suppose T: B ^ C is a linear map between Banach spaces such that if 
Xn in B then {g o T){xn) — )■ for every g G C* . Then T is bounded. 

29.5.4. Proposition. Let T : B ^ C be a linear function between Banach spaces. Define T*f = 
f oT for every f e C* . If T* maps C* into B* , then T is continuous. 

29.5.5. Definition. A bounded linear map from a Banach space into itself is a PROJECTION if it 
is idempotent. If P is such a map, we say that it is a projection ALONG kerP ONTO ran P. 

29.5.6. Proposition. An idempotent linear map from a Banach space into itself is a projection if 

and only if it has closed kernel and range. 

29.5.7. Proposition. Let H be a Hilbert space and S and T be functions from H into itself such 
that 

{Sx,y) = {x,Ty) 

for all X, y E H. Then S and T are hounded linear operators. (Recall that in this case the operator 
T is the (Hilbert space) adjoint of S and T = S* .) 

29.5.8. Definition. Let a < 6 in M. A function /: [a, 6] — t- M is continuously differentiablc if it is 
differentiable on (an open set containing) [a,h] and its derivative /' is continuous on [a, h]. The set 
of all continuously differentiable real valued functions on [a, h] is denoted by C^( [a, 6] ). 

29.5.9. Example. Let D : Ci( [0, 1] ) ^ C( [0, 1] ) : / ^ / ' where both [0, 1] ) and C{ [0, 1] ) 
are equipped with the uniform norm. The mapping D is linear and has closed graph but is not 
continuous. (Why does this not contradict the closed graph theorem?) 

29.6. Projections and Complemented Subspaces 

29.6.1. Definition. Let M be a closed linear subspace of a Banach space B. If there exists a 

closed linear subspace N of B such that B = M (B N, then we say that M is a COMPLEMENTED 
subspace, that A'^ is its (Banach space) complement, and that the subspaces M and N are 

COMPLEMENTARY. 

29.6.2. Proposition. Let M and N be complementary subspaces of a Banach space B. Clearly, 
each element in B can be written uniquely in the form m + n for some m E M and n E N. Define 
a mapping P : B B by P{m + n) = m for m E M and n E N. Then P is the projection along 
N onto M. 

29.6.3. Proposition. If P is a projection on a Banach space B, then its kernel and range are 

complementary subspaces of B. 

29.6.4. Proposition. Let B he a Banach space. If P: B ^ B is a projection onto a subspace M 
along a subspace N , then I — P is a projection onto N along M . 

29.6.5. Proposition. If M and N are complementary subspaces of a Banach space B, then B/M 
and N are isomorphic. 

29.6.6. Proposition. A bounded linear map T: B C between Banach spaces has a left inverse 
if and only if T is injective and ran T is complemented. 

29.6.7. Proposition. Let T: B C and S: C B be bounded linear maps between Banach 
spaces. If ST = I, then 

(a) S is surjective; 

(b) T is injective; 
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(c) TS is a continuous projection along ker S onto ran T; and 

(d) C = ran re ker 5. 

29.6.8. Exercise. Let S: C — )• i? be a bounded linear map between Banach spaces. Find a 
necessary and sufficient condition for S to have a right inverse. 

29.7. The Principle of Uniform Boundedness 

29.7.1. Definition. Let S* be a set and y be a normed linear space. A family of functions 
from S into V is POINTWISE bounded if for every x € S there exists a constant > such that 
||/(x)|| <M, for every /G^. 

29.7.2. Definition. Let V and W be normed linear spaces. A family T of bounded linear maps 
from V into W is UNIFORMLY BOUNDED if there exists M > such that ||T|| < M for every T £ T. 

29.7.3. Proposition. Let V and W be normed linear spaces. If a family T of bounded linear maps 
from V into W is uniformly bounded, then it is pointwise bounded. 

The principle of uniform boundedness is the assertion that the converse of 29.7.3 holds whenever 
V is complete. To prove this we will make use of a "local" analog of this for continuous functions 
on a complete metric space. 

29.7.4. Proposition. Let M be a complete metric space and T C C(M, M). If for every x £ AI 
there exists a constant > such that \f{x)\ < Nx for every f £ F , then there exists a nonempty 
open set U in M and a number N > such that < N for every f £ J- and every u £ U. 

Hint for proof . For each natural number k let Aj^ = n{/^( ^] ) : / £ F}. Conclude from 
the Baire category theorem 29.3.21 that at least one has nonempty interior. Let U = An" . 

29.7.5. Theorem (Principle of uniform boundedness). Let B be a Banach space and W be a 
normed linear space. If a family T of bounded linear maps from B into W is pointwise bounded, 
then it is uniformly bounded. 

Hint for proof . For every T £% define 

f^: B ^M.: x^ \\Tx\\. 

Use the preceding proposition to show that there exist a nonempty open subset U of B and a 
constant M > such that frp{x) < M for every T S T and every x £ U. Choose a point a £ B 

and a number r > so that Br{a) C U. Then verify that 

\\Ty\\<r-\f^{a + ry)+f^{a))<2Mr-' 
for every T € T and every y £ B such that ||y|| < 1. 

29.7.6. Theorem (Banach-Steinhaus). Let B be a Banach space, W be a normed linear space, and 
(Tn) be a sequence of bounded linear maps from B into W . If the pointwise limit lim^^oo TnX exists 
for every x in B, then the map S : B ^ W : x i— )• lim„_!>oo TnX is a bounded linear transformation. 

29.7.7. Definition. A subset j4 of a normed linear space V is weakly bounded (tt;-bounded) if 
f'^{A) is a bounded subset of the scalar field of V for every f £ V* . 

29.7.8. Proposition. A subset of a normed linear space is bounded if and only if it is weakly 
bounded. 

29.7.9. Exercise. Your good friend Fred R. Dimm needs help again. This time he is worried 
about the assertion (see 29.7.8) that a subset of a normed linear space is bounded if and only if it is 
weakly bounded. He is considering the sequence (a„) in the Hilbert space P defined by a„ = -^/ne" 
for each n S N, where {e*^: re G N} is the usual orthonormal basis for I2 (see example 28.1.16). He 
sees that it is obviously not bounded in the usual (norm) topology on P. But it looks to him as if it 
is weakly bounded. He argues that there is no sequence x £ P such that the set {|(ari,x)| : n E N} 
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is unbounded, because such a sequence would have to decrease more slowly than the sequence 
(n^^/^), which already decreases too slowly to belong to P. Put Fred's mind to rest by finding him 
a suitable sequence. 

29.7.10. Definition. A sequence in a normed linear space V is WEAKLY Cauchy if the 
sequence {f{xn)) is Cauchy for every / in V* . The sequence CONVERGES WEAKLY to a point a 

if /(.T„) f{a) for every f £ V* (that is, if it converges in the weak topology induced by the 

members of V*). In this case we write Xn — ^^a. 

29.7.11. Proposition. In a normed linear space every weakly Cauchy sequence is bounded. 

29.7.12. Proposition. Let E he an orthonormal basis for a Hilhert space H and [xn) be a sequence 
m H. Then x^^^O if and only if (.t„) is bounded and (.t„, e) — t- for every e £ £ . 

29.7.13. Definition. Let H and K be Hilbert spaces. A sequence (r„) in ^{H,K) CONVERGES 
WEAKLY (or CONVERGES IN THE WEAK OPERATOR TOPOLOGY) to i? G ^{H,K) if 

{TnX, y) {Bx, y) 

for every x G H and every y £ K. In this case we write r„ — > B. 

A sequence (r„) in ^(H,K) converges strongly (or converges in the strong opera- 
tor topology) to B g ^{H,K) if 

TnX — >■ Bx 
SOT 

for every x £ H. In this case we write T„ B. 

A family T C ^{H,K) is WEAKLY BOUNDED (or BOUNDED in the weak operator TOPOL- 
OGY) if for every x £ H and every y £ K there exists a positive constant ax^y such that 

\{Tx,y)\< ax,y 

for every T G T. 

29.7.14. Proposition. Let H and K be Hilbert spaces, (Tn) be a sequence in ^{H,K), and 
B £ ^{H,K). then the following implications hold: 

T. JD V SOT j-^ ^ rp WOT 

29.7.15. Proposition. Let (Sn) and (Tn) be sequences of operators on a Hilbert space H. If 

Sn — > A and Tn — B, then SnTn — AB. 

Hint for proof . You may find it helpful to prove first that every weakly convergent sequence of 
Hilbert space operators is bounded. 

SOT 

29.7.16. Example. In the preceding proposition the hypothesis r„ ^B cannot be replaced 

, rj, WOT n 

by Tn ^ B. 
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30.1. The Spectrum 

In this chapter we wih, for the first time, assume, unless otherwise stated, that vector spaces 
and algebras have complex, rather than real, scalars. A good part of the reason for this is the 
necessity of using Liouville's theorem 30.1.7 to prove proposition 30.1.8. 

30.1.1. Proposition. Let a be an element of a unital algebra such that a} = 1. Then either 

(i) a = 1, in which case ^{a) = {1}, or 

(ii) a = —1, in which case a{a) = {—1}, or 

(iii) a(a) = {-l,l}. 

Hint for proof . In (iii) to prove a(a) C { — 1,1}, consider -rfa + Al). 

1 — 

30.1.2. Proposition. Recall that an element a of an algebra is idempotent i/a^ = a. Let a be 
an idempotent element of a unital algebra. Then either 

(i) a = 1, in which case ^{a) = {1}, or 

(ii) a = 0, in which case cr(a) = {0}, or 

(iii) a(a) = {0,l}. 

Hint for proof . In (iii) to prove a{a) C {0, 1}, consider (a + (A — l)l) . 

A — A 

30.1.3. Proposition. Let a be an element of a unital Banach algebra A. Then the spectrum of a 
is compact and |A| < ||a|| for every A G fT(a). 

Hint for proof . Use the Heine-Borel theorem. To prove that the spectrum is closed notice that 
{a{a)Y = /^(inv^) where /(A) = a — Al for every complex number A. Also show that if |A| > ||a||, 
then 1 — A~^a is invertible. 

30.1.4. Definition. Let a be an element of a unital Banach algebra. The resolvent mapping 
for a is defined by 

Ra-. C\a{a) ^ A: (a - Al)"^ 

o 

30.1.5. Definition. Let [/ C C and ^4 be a unital Banach algebra. A function /: [/ — t- A is 
ANALYTIC on U if 

/(a) := hm M^M. 
z^a z — a 

exists for every a G [/. A complex valued function which is analytic on all of C is an entire 
function. 

30.1.6. Proposition. For a an element of a unital Banach algebra A and (p a bounded linear 
functional on A let f := cj) o R^^ : C \ a{a) — >■ C. Then 

(i) / is analytic on its domain, and 

(ii) /(A) ^ as |A| ^ oo. 

Hint for proof . For (i) notice that in a unital algebra — = a~^{b — a)b~^. 

30.1.7. Theorem (Liouville). Every bounded entire function on C is constant. 
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Comment on proof . The proof of this theorem requires a Uttle background in the theory of 
analytic functions, material that is not covered in these notes. The theorem would surely be 
covered in any first course in complex variables. You can find a nice proof in [37], theorem 10.23. 

30.1.8. Proposition. The spectrum of every element of a unital Banach algebra is nonempty. 

Hint for proof . Argue by contradiction. Use Liouville's theorem 30.1.7 to show that (f) o is 
constant for every bounded linear functional (p on A. Then use (corollary 29.1.7 to) the Hahn- 
Banach theorem to prove that Ra is constant. Why must this constant be 0? 



Note. It is important to keep in mind that we are working only with complex alge 
is false for real Banach algebras. An easy counterexample is the matrix ' ^ 



jras. This result 
regarded as an 



1 

element of the (real) Banach algebra M2 of all 2 x 2 matrices of real numbers. 

30.1.9. Theorem (Gelfand-Mazur). If A is a unital Banach algebra in which every nonzero ele- 
ment is invertible, then A is isometrically isomorphic to C. 

Hint for proof . Let B = {Al: A G C}. Use the preceding proposition 30.1.8) to show that 
B = A. 

The following is an immediate consequence of the Gelfand-Mazur theorem 30.1.9 and proposi- 
tion 21.6.4. 

30.1.10. Corollary. If I is a maximal ideal in a commutative unital Banach algebra A, then A/ 1 
is isometrically isomorphic to C. 

30.1.11. Proposition. Let a be an element of a unital algebra. Then 

a{a-) = [a{a)r 

for every n G N. (The notation [cj(a)]" means {A": A G cr{a)}.) 

30.1.12. Definition. Let a be an element of a unital algebra. The spectral radius of a, denoted 
by p{a), is defined to be sup{|A| : A G o"(a)}. 

30.1.13. Theorem (Spectral radius formula). If a is an element of a unital Banach algebra, then 

p{a) =inf{||a"f/": n G N| = lim ||a"f/". 

n— !>oo 

30.1.14. Proposition. Let a be an invertible element of a unital Banach algebra. Then a complex 
number A belongs to the spectrum of a if and only ifl/X belongs to the spectrum of a~^. 

30.1.15. Proposition. Let A be a unital Banach algebra and B a closed subalgebra containing 1a- 
Then 

(1) inv B is both open and closed in B Ci inv^; 

(2) (Ta(6) C asib) for every b G B; and 

(3) if b £ B and aAib) has no holes (that is, if its complement in C is connected), then 
(ja(6) = crsib). 

30.2. Characters 

30.2.1. Definition. A character (or nonzero multiplicative linear functional) on an 
algebra A is a nonzero homomorphism from A into C. The set of all characters on A is denoted 
by AA. 

30.2.2. Proposition. Let A be a unital algebra and cj) G A^. Then 

(a) 4>W = 1; 

(b) if a £ inY A, then (j){a) / 0; 

(c) if a is nilpotent (that is, if a" = for some n gN), then 4>{a) = 0; 
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(d) if a is idempotent, then (p{a) is or 1; and 

(e) (j){a) G a{a) for every a G A. 

Why is part (e) of the preceding proposition not an easy way to show that spectrum of an 
element of a unital algebra is nonempty? 



30.2.3. Example. The algebra M2(C) of 2 x 2 matrices of complex numbers has no characters. 
Hint for proof . Square the matrix 



1 

1 



30.2.4. Exercise. Let A be the algebra of all 2 x 2 matrices [aij] of complex numbers whose upper 
right entry ai2 is zero. Find the set of characters and the set of maximal ideals of A. 



Hint for solution. Notice that 

a O" 
b c 

Use the preceding proposition. 



1 
1 



+ b 





1 




1 



30.2.5. Proposition. Let A be a unital algebra and (j) be a linear functional on A. Then 4> S AA 
if and only if]ieT(j) is closed under multiplication and = 1. 

Hint for proof . For the converse apply (p to the product of a — (j){a)l and b — (j){b)l for a, b G A. 

30.2.6. Example. The only character on the Banach algebra C is the identity map on C 

30.2.7. Proposition. Every multiplicative linear functional on a unital Banach algebra A is con- 
tinuous. In fact, if (j) G A(A), then \\(j)\\ = 1. 



30.2.8. Proposition. Let X be a topological space and x G X. Then the evaluation functional 
at X 

Ex{x): C{X,C)^C: f^ fix) 

is a character on C{X,C). 



30.2.9. Definition. Let A be a unital Banach algebra. We have shown that \\<p\\ = 1 for every 
character (p on A. That is, the set of characters is contained in the unit sphere 5'i(0) of the 
dual A* of the Banach algebra A. Thus we may (and do) regard A A as possessing the relative 
u;*-topology inherited from A*. This is the Gelfand topology on AA. The topological space 
which results from endowing the set AA with the Gelfand topology is called the character space 
or the CARRIER SPACE or (for reasons that will soon become apparent) the maximal ideal space 
of ^. 

30.2.10. Proposition. The character space of a unital Banach algebra is a compact Hausdorff 
space. 

Hint for proof . Use Alaoglu's theorem 29.2.7. 

30.2.11. Proposition. If (p G AA where A is a unital algebra, then keicj) is a maximal ideal in A. 

Hint for proof . To show maximality, suppose / is an ideal in A which properly contains ker (/>. 
Choose z £ I\ker(/). Consider the element x — [(j){x) / (p{z)) z where x is an arbitrary element of A. 

30.2.12. Proposition. A character on a unital algebra is completely determined by its kernel. 

Hint for proof . Let a be an element of the algebra and (phe a character. For how many complex 
numbers A can — Xa belong to the kernel of (p7 



30.2.13. Corollary. If A is a unital algebra, then the map 
injective. 



I— )■ kercp from A A to Max^ is 
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30.2.14. Exercise. Let I be a maximal ideal in a commutative miital Banach algebra A. Then 
there exists a character on A whose kernel is /. 

Hint for proof . Why can we think of the quotient map as a character? 

30.2.15. Corollary. If A is a unital algebra, then the map (f) i— )■ ker</) is a bijection from AA 
onto Max A. 

30.2.16. Definition. Let A be a unital commutative Banach algebra. In light of the preceding 
corollary we can give Max A a topology under which it is homeomorphic to the character space AA. 
This is the maximal ideal space of A. Since A A and Max^ are homeomorphic it is common 
practice to identify them. 

30.2.17. Proposition. Let X be a compact Hausdorff space. Then the evaluation map on X 

Ex: X ^ AC{X, C):x^ Ex{x) 

is a homeomorphism. Thus 

X ^ AC{X,C) ~ MaxC(X,C) . 

30.2.18. Proposition. Let A and B be unital commutative Banach algebras and T: A ^ B be an 
algebra homomorphism. Define AT on AB by 

AT(V') = '4)oT 

for all ip G AB. Then 

(a) AT maps AB into AA. 

(b) The map AT : AB — )• is continuous. 

(c) IfT is surjective, then AT is injective. 

(d) If T is an (algebra) isomorphism, then AT is a homeomorphism. 

(e) The pair of maps A i— t- A^ and T i— >■ AT is a contravariant functor from the category 
of unital commutative Banach algebras and algebra homomorphisms to the category of 
compact Hausdorff spaces and continuous maps. 

(f) If A and B are (algebraically) isomorphic, then AA and AB are homeomorphic. 

30.2.19. Proposition. Let X and Y be compact Hausdorff spaces. IfC{X,C) and C{Y,C) are 
algebraically isomorphic, then X and Y are homeomorphic. 

30.2.20. Corollary. Two compact Hausdorff spaces are homeomorphic if and only if their algebras 
of continuous complex valued functions are isomorphic. 

30.2.21. Corollary. Let X andY be compact Hausdorff spaces. IfC{X,C) and C{Y,C) are alge- 
braically isomorphic, then they are isometrically isomorphic. 

30.2.22. Example. In example 24.3.10 it was pointed out that the Banach space Li(M, A) can be 
made into a Banach algebra using convolution as "multiplication". The sequence space is a 
Banach space. If we replace M by Z and Lebesgue measure A with counting measure, we then make 
li (Z) into a Banach algebra. 

(a) Explain in more elementary terms (that is, without explicitly mentioning the word "mea- 
sure") how to make the Banach space ^i(Z) into a commutative Banach algebra. Verify 
your assertions. 

(b) For each n E Z let e" be the sequence in /i(Z) that is 1 in the n^^ coordinate and 
elsewhere. What is e" * e'"? Is Zi(Z) a unital algebra? 

(c) The maximal ideal space of the Banach algebra ^i(Z) is the unit circle T. 

Hint for proof . To show that T ~ A/i(Z) define 

oo 

V'z: /i(Z) C: a i-> ^ auz^ . 

k=—oo 
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Show that ipz £ A/i(Z). Then show that the map 

ip-.T-^ A/i(Z): z^ip^ 

is a homeomorphism. 

Next we wiU fulfill a promise made in example 25.3.5 and show that the range of the evaluation 
map is naturally homeomorphic to X itself whenever X is a compact Hausdorff space. 

30.2.23. Theorem. The mapping Ex which takes compact Hausdorff spaces to their correspond- 
ing evaluation maps is a natural equivalence between the identity functor and the functor AC in 
the category of compact Hausdorff spaces and continuous maps. That is, the following diagram 
commutes and each vertical arrow is a homeomorphism. 



X 



Y 



Ex 



Ey 



Y Y 

AC{X) AC(Y) 

AC{ct>) 

(Note: in the preceding diagram C{X) and C{Y) are algebras of complex valued continuous func- 
tions.) 

30.2.24. Remark. Thus a compact Hausdorff space X and its character space AC{X) are not 
only homeomorphic, they are naturally homeomorphic. This provides strong justification for the 
(very common) informal assertion that the maximal ideals of the Banach algebra C{X) "are" just 
the points of X. 

30.3. C*-aIgebras 

30.3.1. Definition. An involution on an algebra yl is a map x ^ x* from A into A which 
satisfies 

(i) {x + y)* = X* +y\ 
(Si) [ax)* =ax*, 

(iii) x** = X, and 

(iv) (xy)* = y*x* 

for all x,y £ A and a G E. An algebra on which an involution has been defined is a *-ALGEBRA 
(pronounced "star algebra"). An algebra homomorphism (p between * -algebras which preserves 
involution (that is, such that 4>{a*) = (0(a))*) is a *-homomorphism (pronounced "star homomor- 
phism" . A * -homomorphism (j): A ^ B between unital algebras is said to be unital if (j){'^A) = 1_B- 

30.3.2. Example. Complex conjugation is an involution on the algebra C of complex numbers. 

30.3.3. Example. Let X be a topological space. Then complex conjugation is an involution on 
the algebra C(X, C) of complex valued functions on X. 

30.3.4. Example. On the algebra M„(C) of n x n matrices of complex numbers the operation of 
conjugate transpose is an involution. (The conjugate transpose of the matrix [aij] is the matrix 
\bij\ where hij = ajl for alH, j S N„.) 

30.3.5. Example. On the algebra ^(H) of operators on a Hilbert space the operation T i— )• T* of 
taking adjoints is an involution. 

30.3.6. Proposition. Let a and b be elements of a * -algebra. Then a commutes with b if and only 
if a* commutes with b* . 
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30.3.7. Proposition. In a unital * -algebra 1* = 1. 

30.3.8. Proposition. Let a be an element of a unital * -algebra. Then a* is invertible if and only 
if a is. And when a is invertible we have 

30.3.9. Proposition. Let a be an element of a unital * -algebra. Then A G o"(a) if and only if 
Xea{a*). 

30.3.10. Definition. An element a of a *-algebra A is self- AD joint (or hermitian) if a* = a. It 
is NORMAL if a*a = aa* . And it is unitary if a*a = aa* = 1. The set of all self-adjoint elements of 
A is denoted by T-L{A), the set of all normal elements by M{A), and the set of all unitary elements 
hyU{A). 

30.3.11. Proposition. Let a be an element of a * -algebra. Then there exist unique self-adjoint 
elements u and v such that a = u + iv. 

Hint for proof . Think of the algebra elements as being "like" complex numbers and self-adjoint 
elements as being "like" real numbers. 

30.3.12. Definition. A C*-algebra is a Banach algebra A with involution which satisfies 

II * II II Il2 

||o a\\ = \\a\\ 

for every a ^ A. 

30.3.13. Example. The set C of complex numbers is a C*-algebra. (Use the usual algebraic 
operations, absolute value for norm, and complex conjugation for involution.) 

30.3.14. Example. Let X be a nonempty locally compact Hausdorff space. Then, under the 
usual pointwise operations, uniform norm, and complex conjugation for involution, Cq{X,C) is a 
commutative C*-algebra, which does not necessarily have an identity. If X is compact, then the 
C*-algebra Co(X, C) = C{X, C) is unital. 

30.3.15. Example. Let H he a, Hilbert space. Then ^{H) is a C*-algebra. (Use composition for 
multiplication, the operator norm, and adjoints for involution. See proposition 28.2.14.) 

30.3.16. Proposition. In a C* -algebra involution is an isometry. That is, \\a*\\ = \\a\\ for every 
element a of the algebra. 

30.3.17. Proposition. In a unital C* -algebra ||1|| = 1. 

30.3.18. Proposition. If a is a normal element of a unital C* -algebra, then ||a^|| = and 
consequently p[a) = \\a\\. 

30.3.19. Corollary. There is at most one norm on a *-algebra under which it is a C* -algebra. 

30.3.20. Proposition. A unital * -homomorphism (j): A ^ B between unital C* -algebras takes 
invertible elements of A to invertible elements of B. Thus, (j{(j){a)) C a{a) for every a & A. 

30.3.21. Proposition. Every unital * -homomorphism between unital C* -algebras is contractive 
(and therefore continuous). 

30.4. The Gelfand-Naimark Theorem 

30.4.1. Definition. Let A be a commutative unital Banach algebra and a G A. We define 
r^a: — )• C by r^a(r) = r(a) for every r G A(A). (Alternative notations: we frequently write 
Ta or a for I^a.) Since /\A C A* it is clear that I^a is just the restriction of a** to the character 
space of A. Recall that the Gelfand topology on AA is the relative tt;*-topology; that is, the weakest 
topology under which o** is continuous on A A for each a & A. Thus I^a is a continuous function on 
AA. The map T^: A ^ C(AA, C) is the Gelfand transform of A. (The element F^a = Ta = a 
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is called the Gelfand transform of a — because trying repeatedly to say the Gelfand transform of A 
evaluated at a is unpleasant.) Since AA is a compact Hausdorff space C{AA, C) is a unital Banach 
algebra. 

30.4.2. Proposition. If A is a unital commutative Banach algebra, then 1^ : A — )• C{AA,C) is a 
contractive algebra homomorphism having norm one. Furthermore 1^1^ is the constant function 
1 on AA and the range ofT^ is a separating subalgebra of C{AA,C) . 

30.4.3. Proposition. Let a be an element of a commutative unital Banach algebra. Then a is 
invertible in A if and only if a is invertible in C{AA, C). 

30.4.4. Proposition. Let a be an element of a commutative unital Banach algebra A. Then the 
following are equivalent: 

(a) limlla'^f/" = 0; 

(b) p(a) = 0; 

(c) a{a) = {0}; 

(d) r^a = 0; 

(e) T(a) = for every r G A^l; 

(f) a ef]MaxA. 

30.4.5. Proposition. Let A be a commutative unital Banach algebra. Then the following are 
equivalent: 

(a) «/lim ||a"f = 0, then a = 0; 

(b) if p{a) = 0, then a = 0; 

(c) if a {a) = {0}, then a = 0; 

(d) Tj^ is a monomorphism; 

(e) if T{a) = for every r G A^, then a = 0; 

(f) f]MaxA = {0}. 

30.4.6. Proposition. Let X be a compact Hausdorff space. Then the Gelfand transform on the 
Banach algebra C{X,C) is an isometric isomorphism. 

Hint for proof. Let A = C{X, C). Since the evaluation map E: X ^ AA on X is a homeo- 
morphism, it follows that the induced map CE: C{AA,C) — )• A is an isometric isomorphism. Show 
that r is simply (CE)^^. 

30.4.7. Proposition. If a is an element of a commutative unital Banach algebra, then ran a = cr(a) 
and \\a\\u = p{a). 

30.4.8. Example. Let be a positive measure space. The Gelfand transform on the unital 
commutative Banach algebra Loo(5', C) is an isometric isomorphism. 

Hint for proof. Use example 18.4.13 together with propositions 30.4.7 and 18.4.8 to show 
that ||r/||„ = ll/lloo- From this and proposition 30.4.2 it is easy to see that the range of F is a 
closed separating unital subalgebra of C(A(Loo(5', C))) . To see that this subalgebra is self-adjoint 
use proposition 30.4.7 and example 18.4.13. Then the conclusion is immediate from the Stone- 
Weierstrass theorem 21.2.14. 

Next we examine some of the peculiarities associated with the rather pathological maximal 
ideal space of the unital commutative Banach algebra Lqo ([0, 1], C) . 

30.4.9. Example. Let A = Loo ([0, 1], C) . The maximal ideal space A^ is made into a probability 
space in a standard fashion. Since 



F: A^CiAA): f^Tf 
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is an isometric isomorphism (see 30.4.8), the map 

(t>: C{AA) ^ C: r/ ^ [ f dX 

Jo 

is weU-defined and is a bounded Hnear functional with ||(/>|| = 1. By the Riesz representation 
theorem (25.4.7) there exists a unique regular Borel measure /i on such that ||/i|| = 1 and 

/ (r/)d/i= [' fdX 

Jaa Jo 

for all Tf in C{AA). 

30.4.10. Lemma. As in the preceding example let A = Lqo ([0, 1], C) and regard AA as a probability 
space. Every nonempty open subset of AA has strictly positive measure. 

30.4.11. Example. Let A = Loo ([0, 1], C) . If / is a complex valued Borel measurable function 
on the maximal ideal space AA, then there exists a continuous function h on AA such that f = h 
/i-a.e. (where n is the probability measure defined on AA in example 30.4.9). Roughly, we are 
saying that 

Loo(A^,C) =C(AA). (30.1) 
Of course, this cannot be literally true since the objects on the left side of (30.1) are equivalence 
classes of functions whereas the objects on the right are functions. Interpret (30.1) as saying that 
each equivalence class in Loo(AA, C) has a continuous member. Quite a remarkable property! 

The next example may make one feel better about the last one. 

30.4.12. Example. The maximal ideal space of Loo([0, 1],C) is extremally disconnected. 

30.4.13. Proposition. If u is a unitary element in a unital C* -algebra, then (y{u) is contained in 
T, the unit circle in the complex plane. 

30.4.14. Proposition. If h is a self-adjoint element in a commutative unital C* -algebra, then 
a{h) C M. 

30.4.15. Corollary. Every character on a unital commutative C* -algebra is a * -homomorphism. 

30.4.16. Theorem (Gelfand-Naimark Theorem). Let A be a commutative unital C* -algebra. Then 
the Gelfand transform : a is an isometric * -isomorphism of A onto C{AA,C). 

30.5. The Spectral Theorem 

30.5.1. Definition. Let S be a subset of a *-algebra A. Then S* = {s* : s E 5}. The subset 
S is SELF-ADJOINT if S* = S. (Note: this does not say that each element of S is self- adjoint.) 
A nonempty self-adjoint subalgebra of ^4 is a *-subalgebra (or a SUB-*- algebra). Let A be a 
C*-algebra. A nonempty subset of A is a C*-SUBALGEBRA (or a sub-C*-algebra) of A if it is a 
C*-algebra under the algebraic operations and norm it inherits from A. 

30.5.2. Proposition. Let A be a C* -algebra. The closure of a * -subalgebra of A is a C* -subalgebra 
of A. 

30.5.3. Definition. Let 5 be a nonempty subset of a C*-algebra A. The intersection of the family 
of all C*-subalgebras of A which contain S is the C*-SUBALGEBRA generated by S. We denote 
it by C*{S). (It is easy to see that the intersection of a family of C*-subalgebras is a C*-algebra.) 
In some cases we shorten the notation slightly: for example, if a G A we write C*{a) for C*{{a}). 

30.5.4. Proposition. Let S be a nonempty subset of a C* -algebra A. For each natural number n 
define the set Wn to be the set of all elements a of A for which there exist xi, X2, . . . , Xn in S U S* 
such that a = X1X2 ■ ■ ■ Xn- Let W = U^;^VF„. Then 

C*{S) = span(VF). 
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30.5.5. Theorem (Abstract Spectral Theorem). If a is a normal element of a C* -algebra, then 
the C* -algebra C{a(a),C) is isometrically * -isomorphic to C*{l,a). 

Hint for proof . Use the Gelfand transform of a to identify the maximal ideal space of C*(l, a) 
with the spectrum of a. Apply the functor C. Compose the resulting map with where T is the 
Gelfand transform on the C*-algebra C*(l,a). 

30.5.6. Exercise. Suppose that in the preceding theorem -0: C{a{a),C) — )• C*(l,a) implements 
the isometric *-isomorphism. What is the image under ip of the constant function 1 on the spectrum 
of a? What is the image under ■0 of the identity function A i— )• A on the spectrum of a? 

30.5.7. Exercise. Let T be a normal operator on a Hilbert space H whose spectrum is contained 
in [0,oo). Suppose that : C{a{T),C) — )■ C*(1,T) implements the isometric *-isomorphism between 
these two C*-algebras. By \/T we presumably mean an operator on H whose square is T. How do 
we know that there is at least one such operator? In case there are several such operators explain 
how the abstract spectral theorem singles out one of these. If / is a continuous function on the 
spectrum of T explain what we mean by the expression f{T). 

30.5.8. Definition. A projection operator on a Hilbert space is an orthogonal projection if 
its range is orthogonal to its kernel. 

In 29.5.5 we defined projections on a Banach space B to be the idempotents in 55(5). We now 
characterize the orthogonal projections on a Hilbert space H as the self-adjoint idempotents in the 
C7* -algebra "SiH). 

30.5.9. Proposition. A projection P : H ^ H on a Hilbert space is an orthogonal projection if 
and only if P = P* . 

30.5.10. Proposition. If N is a normal Hilbert space operator whose spectrum is {0, 1}, then N 
is an orthogonal projection. 

30.5.11. Proposition. The Gelfand transform T is a natural equivalence between the identity func- 
tor and the CA functor on the category of commutative unital C* -algebras and * -homomorphisms. 

30.5.12. Theorem (Spectral Mapping Theorem). If a is a normal element of a C* -algebra and 
f £ C{a{a), C), then 

a{f{a)) = f{a{a)). 



Hint for proof . Use the abstract spectral theorem 30.5.5. The proof should be very short. 
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everywhere, 136 
analytic, 223 
annihilator, 29, 214 
anti-isomorphism, 162 
antilinear, 162 
antisymmetric, 8 
approximation theorem, 164 
arc, 200 
Archimedean 

property 

of the real numbers, 44 

Riesz space, 104 

example of a non-, 104 
arcwise connected, 200 
arithmetic progression, 74 
Arzela-Ascoli theorem, 156 
Ascoli-Arzela theorem, 156 
associative, 21 
atom, 136 

a.u. (almost uniformly), 203 
axiom 

of infinity, 45 
axiom of choice, 13 

Ba (band generated by A), 119 
Ba (principal band), 119 
Baire 

category theorem, 217 
space, 217 
BALG 

products and coproducts in, 158 
quotients in, 169, 219 
the category, 158 
ball 
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closed, 75 
open, 75 

Br (a) (open ball of radius r about a), 75 
Cr(a) (closed ball of radius r about a), 75 
BAN 

quotients in, 218 

the category, 156 
Banach algebra, 156 

Mn as a, 156 

B{S) as a, 152, 156 

C{X) as a, 152, 156 

Co{X) as a, 156 

CbiX) as a, 152, 156 

Lcx)(5) as a, 156 

©(F) as a, 156 

direct sum, 158 

homomorphism, 158 
Banach decomposition theorem, 39 
Banach lattice, 163 

ba(5) as a, 163 

ca(5) as a, 163 

B{S) as a, 163 

C{X) as a, 163 

Co{X) as a, 163 

CbiX) as a, 163 

Li(S') as a, 163 

Lcx)(5) as a, 163 

Lp(5') as a, 163 

5im(5') as a, 163 
Banach space, 156 

ba(S') as a, 157 

ca(S') as a, 157 

Li(S') as a, 156 

Lp(S') as a, 157 

'B{V,W) as a, 156 

M(X) as a, 198 

direct sum, 158 
band, 119 

generated by a set, 119 

principal, 119 
ba(5',2l), ba(S) 

as a Banach lattice, 163 

as a Banach space, 157 

as a normed linear space, 91, 125 

as a Riesz space, 102 

as an ordered vector space, 70 

bounded finitely additive set functions, 69 
base 

for a topology, 73 

for neighborhoods of a point, 120 

for the discrete topology, 73 

for the topology of a metric space, 76 

for the usual topology on R, 73 

for topology of pointwise convergence, 85 
basic 

neighborhoods, 120 

open sets, 73 
basis 

Hamel, 36 

Hilbert space, 208 



orthonormal, 208 

vectors 
for R^ 27 
Bayes' 

box paradox, 183 

law, 183 
Bernoulli's inequality, 49 
Bessel's inequality, 208 

/3(X ) (the Stone-Cech compactification), 132 
bijection, 12 
bijective, 12 
bijective morphisms 
are invertible 
in AbGp, 24 
in ALG, 31 
in BANi, 218 
in CpH, 128 
in LAT, 61 
in RS, 105 
in SET, 61 
in VEC, 27 
need not be invertible 
in POSET, 61 
in RSP, 105 
in TOP, 82 
binary operation, 21 
binomial 

coefficient, 48 
theorem, 48 
Boolean algebra 
Z2 as a, 40 
q3(5) as a, 40 
definitions, 39, 41 
propositional calculus as a, 40 
trivial, 40 
Boolean ring, 30 
Borel 

measurable function, 111 
measure, 194 

inner regular, 194 

outer regular, 194 

regular, 194 
sets, 110 

<Bor{X) (family of Borel subsets of X), 110 
bound 

greatest lower, 34 

least upper, 34 

lower, 34 

upper, 34 
boundary, 80 
bounded 

above, 34, 51 

away from zero, 173, 219 

below, 34, 51, 219 

essentially, 140 

finitely additive set function, 69 
function into a metric space, 78 
linear functionals, 96 
linear map 
order, 106 
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linear transformation, 93 
metric, 78 
net in R, 121 
order, 106 
pointwise, 155, 221 
real valued function, 26 
sequence, 51, 78 
sesquilinear functional, 209 
subset of a metric space, 78 
totally, 152 
uniformly, 155, 221 
variation, 172 
weakly, 222 

in a normed linear space, 221 

as a Banach algebra, 152, 156 

as a Banach lattice, 163 

as a normed algebra, 95 

as a normed linear space, 90 

as a Riesz space, 101 

as a unital algebra, 30 

as a vector space, 26 

set of bounded real valued functions on S, 26 
B{S, M) 

as a metric space, 78 

set of bounded functions into a metric space, 78 
B{S, V) 

as a normed linear space, 90 

as a Banach algebra, 156 

as a unital normed algebra, 95 

operators, 93 

'&{y,w) 

as a Banach space, 156 
as a normed linear space, 94 
bounded linear maps, 93 
boundedness 

principle of uniform, 221 

BV{[aM) 

as a normed linear space, 173 
as a Riesz space 

with positive functions as positive cone, 173 
as an algebra, 173 

functions of bounded variation on [a, b\, 172 
eVo([a,6]) 

as a normed linear space, 174 
as a Riesz space 

with increasing functions as positive cone, 174 
with positive functions as positive cone, 174 
functions of bounded variation with /(a) = 0, 174 
[a, b] (order interval in an ordered vector space, 104 
Bunyakovskii inequality, 48 

C*-algebra, 228 
C*-subalgebra, 230 

C*-subalgebra generated by a set, 230 
C (set of complex numbers), 5 
cancellable 

left, 29, 60 

right, 29, 60 
cancellation property, 29 



Cantor 

-Bendixson theorem, 130 

-Schroder-Bernstein theorem, 39 

-like set, 130 

function, 130 

set, 129 

has measure zero, 139 
Caratheodory's theorem, 138 
cardinal equivalence, 17 
cardinal number, 17 
cardinality, 17, 39 
card S, 17, 39 
carrier space, 225 
Cartesian product, 64 
ca(S',a), ca(S) 

as a Banach lattice, 163 

as a Banach space, 157 

as a band in ba(5'), 119 

as a normed linear space, 125 

as a Riesz space, 117 

as an order ideal in ba(S'), 117 

real measures on S, 115 
category, 59 

ALG as a, 60 

AbGp as a, 59 

BALG as a, 158 

BAN as a, 156 

CpH as a, 127 

HILi as a, 211 

HILoo as a, 211 

NLSoo as a, 92 

NLSi as a, 92 

RSP as a, 105 

RS as a, 104 

SET as a, 59 

TOP as a, 81 

VEC as a, 59 

concrete, 60 

first, 216 

geometric, 92 

second, 216 

topological, 92 
(pairs of objects and morphisms), 113 
Cauchy 

inequality, 48 

sequence, 151 
weak, 222 
chain, 33 

change of variables 

in Riemann-Stieltjes integrals, 178 
change of variables theorem, 195 
character, 224 

space, 225 
characteristic function, 11 
choice 

axiom of, 13 

function, 13 
class 

equivalence, 14 

sequential covering, 136 
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closed 
ball, 75 

unit, 75 
graph theorem, 219 
linear span, 161 
map, 87 
segment, 68 

subset of a topological space, 72 

under an operation, 21 
closure, 79 

characterization by sequences, 80 

characterization of by nets, 122 
cly (closure in Y), 86 
cluster point 

of a net, 119 

of a sequence, 58 
coarser topology, 72 
cocountable, 109 
codomain, 8 
coefficient 

binomial, 48 
cofinite, 42 
cofinite topology, 72 
collection, 1 
combination 

convex, 68 

linear, 35 
commutative 

algebra, 29 

diagram, 10 

operation, 21 
commutative ring, 28 
commute, 10 
compact 

locally, 130 

relatively, 156 

sequentially, 127 

subset, 127 

topological space, 127 
compactification, 131 

Alexandroff, 132 

one-point, 132 

Stone-Cech, 132 
comparable, 33 
compatibility 

of orderings with operations, 67 
complement 

disjoint, 124 

of a Banach subspace, 220 
of a set, 3 

of an element in a Boolean algebra, 40 

orthogonal, 161 

relative, 4 
complemented subspace, 220 
complete 

Dedekind, 34 

measure space, 189 

metric space, 151 

order, 34 

orthonormal set, 208 



product, 64 

cr-Dedekind, 34 
completely regular 

topological space, 132 
completion 

of a metric space, 153 

of a positive measure space, 190 
complex 

algebra, 29 

measure, 115 
space, 115 

vector space, 25 
complex numbers, 49 

absolute value of, 49 

definition of, 49 

distance between, 49 

imaginary part of, 49 

metric on, 49 

real part of, 49 
component, 200 

Ca (the component of a topological space 

containing a), 200 
composite 

function, 9 
composite of measurable functions 

is measurable. 111 
concentrated, 116 
concrete category, 60 
conditional 

probability, 182 
cone, 68 

generating, 68 

positive, 43, 68 

proper, 68 
conjugate 

exponents, 157 

linear, 209 

transpose, 227 
connected, 199 

arcwise, 200 

component, 200 

path, 200 
consistent 

relatively, 44 

set of axioms, 44 
constant 

locally, 189 
constant function, 11 
construction 

Grothendieck, 45 

of N from sets, 45 

of Q from Z, 46 

of Z from N, 46 

of the real numbers, 44 
containment, 2 

proper, 2 

Co{X) (continuous functions vanishing at infinity), 

156 
continuity 
cr-order, 116 
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absolute 

of a function, 125 

of measures, 125 
at a point, 82 

characterization by nets, 122 
characterization by sequences, 83 
of a function, 81 
right-, 171 

sequential order, 116 

uniform, 83 
continuous 

linear functionals, 96 

part of a positive measure, 136 
C (the functor), 113 
C{X) 

as a Banach algebra, 152, 156 
as a Banach lattice, 163 
as a Riesz space, 101 
as a vector space, 95 

continuous real valued functions on X, 81 
dual of, 198 
C{X,Y) 

continuous functions between topological spaces, 81 
Co(X) 

as a Banach algebra, 156 
as a Banach lattice, 163 
dual of, 198 
Cb(X) 

as a Banach algebra, 152, 156 

as a Banach lattice, 163 

as a normed algebra, 95 

as a normed linear space, 95 

continuous bounded functions on X, 95 
continuously differentiable, 220 
contractive, 92, 158 
contravariant functor, 112 
conventions 

—00 as infimum; oo as supremum, 54 

about distinctness of points in a set, 1 

algebra are real, 29 

all operators are bounded and linear, 93 
almost everywhere, unmodified, means with 

respect to Lebesgue measure, 136 
essentially unique means unique up to 

isomorphism, 63 
function = map = transformation, 8 
Hilbert spaces are complex, 159 
notation for composition 

of algebra homomorphisms, 31 

of group homomorphisms, 24 

of linear maps, 26 

of morphisms, 59 
on ±oo as cluster points, 58 
on choice of product norm, 94 
on meaning of "the limit exists" , 55 
on the arithmetic of extended real numbers, 55 
subspaces are closed, 161 
vector spaces are real, 25 
convergence, 207 
absolute, 152 



almost everywhere, 136, 203 

almost uniformly, 203 

dominated, 147 

in mean, 203 

in measure, 203 
is topological, 206 

monotone 

of sequences, 104 

of a sequence in a topological space, 79 

of a sequence of real numbers, 55 

of infinite series, 121, 152 

of nets, 119 

pointwise, 90, 122 
topology of, 85 

uniform, 90 

weak, 222 
convex 

combination, 68 

hull, 68 

set, 68 
convolution 

of functions, 191 
coordinate 

functions, 112 

projections, 64 
coordinatewise ordering, 33 
coproduct, 65, 154 

in BALG, 158 

in SET, 65 

in TOP, 85 

in VEC, 65 

in a category, 65 

of measurable spaces, 187 

uniqueness of, 65 
countable, 19 

second, 74 
countably 

additive, 115 

Dedekind complete, 34 

infinite, 19 

subadditive, 69, 135 
ca(S), ca(5', 21), ca(S', C) (countably additive set 

functions, 115 
counting measure, 116, 135 
covariant functor, 112 
cover, 3, 20 

open, 127 
covering, 3 

class, 136 
covers, 3 
CpH 

the category, 127 
curve, 200 

D (open unit disk), 5 
dancing functions, 203 
De Morgan's laws 

for Boolean algebras, 40 

for sets, 4 
decomposition 

Hahn, 124 
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Jordan, 101 
Lebesgue, 193 
measurable, 143 
Riesz, 103 

theorem of Banach, 39 
decreasing 
function, 171 
net, 122 
sequence, 104 

sequence of real numbers, 57 

sequence of sets, 54 

strictly, 171 
Dedekind 

(j-complete, 34 

complete, 34 
A/a (a translate of /), 98 
A(^) (character space of an algebra), 224 
dense, 80 

nowhere, 215 
denumerable, 19 
dependent 

linearly, 36 
derivative 

Radon-Nikodym, 146, 193 
derived set, 79 
deviation 

standard, 185 
diagonal, 121 

functor, 113 
diagram, 10 

commutative, 10 
diam^ (diameter of a set), 151 
difference 

set, 4 

symmetric, 4 
differentiable, 99 

continuously, 220 
differential, 99 

dfa (the differential of / at a), 99 

dimension, 37 

Dini's theorem, 128 

Dirac measure, 116 

direct 

product, 64 
sum, 65 

external, 162 
internal, 160 
of Banach algebras, 158 
of Banach spaces, 158 
of Hilbert spaces, 162 
of inner product spaces, 162 
of normed linear spaces, 94 
of vector spaces, 64 
orthogonal, 160, 162 
directed set, 119 
disconnected, 199 
extremally, 201 
totally, 201 
discrete 
metric, 75 



part of a positive measure, 136 

probability space, 181 

topology, 72 
disjoint, 2 

complement, 124 

elements of a Riesz space, 123 

family, 2 

mutually, 2 

pairwise, 2, 53 

union, 3, 65 
topology, 85 
disk, 20 
distance 

between complex numbers, 49 

between sets, 75 

function, 75 
d{A,B) (distance between sets A and B), 75 
d{x,y) (distance between points x and y), 75 
distributes, 21 
distribution 

function 

of a random variable, 184 

probability, 184 
distributive 

lattice, 38 

lattice, ^P(S') as a, 38 

laws, 2, 21 
divergence 

of a sequence of real numbers, 55 
divides, 38 
divisor 

of zero, 29 
domain, 8 

dom / (domain of a function /) , 8 

dominate, 34, 105, 205 

dominated convergence theorem, 147 

di (taxicab metric, product metric), 76 

doohickey, smallest, 35 

du (uniform metric), 76 

dual 

algebraic, 28 

norm, 96 

of C(X), 198 

of Co(X), 198 

of Lp(S'), 197 

of a partially ordered set, 103 

order, 106 

second, 195 

topological, 96 
(order dual of a Riesz space), 106 
V* (dual space of a normed linear space), 96 

(algebraic dual of a vector space), 28 
duality, 103 

functor, 113 
duplication property, 164 
Dynkin system, 110 

Egoroff 's theorem, 204 
element, 1 
embedding 
algebraic, 45 
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topological, 131 
empty set, 2 

(empty set, null set), 2 
entire, 223 
enumeration, 19 
epic, 60 

epimorphism, 60 

e-neighborhood (of a point in R), 57 
e-net, 152 
equality, 2 

by definition, 2 
equally likely outcomes, 181 
equicontinuity 

at a point, 155 

on a set, 155 

uniform, 155 
equivalence 

class, 14 

natural, 195 

relation, 14 
equivalent 

cardinally, 17 

metrics, 77 

norms, 92 

induce identical topologies, 92 
essential 

range, 141 

supremum, 140 
norm, 141 
essentially 

bounded, 140 

unique, 63 
essran/ (essential range of /), 141 
Euclidean 

metric, 76 

norm 

on R", 89 

on the product of two normed spaces, 94 
evaluation 

functional, 28 

map, 196 

maps, 64 
events, 181 

independent, 181 
eventually, 52, 119 
exclusive 

mutually, 181 
exhaustive, 181 
expansion 

ternary, 130 
expectation, 184 
exponents 

conjugate, 157 
extended real numbers, 55 

topology on, 73 
extension, 11 
external 

direct sum, 162 
extremally disconnected, 201 
extreme value theorem, 128 



factor 

group, 61 

vector space, 62 
factorial, 48 
family, 1 

indexed, 52 
Fatou's lemma, 146 
_F-hereditary, 86 

compactness is, 127 
field, 29 

of sets, 41 

properties of R, 43 
finer topology, 72 
finite, 17 

additivity, 69 

dimensional, 37 

intersection property, 127 
and compactness, 127 

variance, 185 

word, 20 

FinS (family of finite subsets of a set S), 85 
first 

category, 216 
first countable, 201 
fixed point, 39 

theorem of Knaster-Tarski, 39 
for (sufficiently) large n, 52 
forgetful functor, 153 
Fourier 

expansion, 208 
Fraktur fonts, x 
frequently, 52, 119 

Fcr subset (of a topological space), 110 
Fubini's theorem, 191 
function, 8 

strictly decreasing, 171 

strictly increasing, 171 

Borel measurable. 111 

Cantor, 130 

characteristic, 11 

choice, 13 

constant, 11 

coordinate, 112 

decreasing, 171 

distribution 

of a random variable, 184 

evaluation at a point, 64 

increasing, 171 

Lebesgue measurable, 140 

Lebesgue singular, 175 

measurable. 111 

monotone, 171 

of a real variable, 8 

of bounded variation, 172 

positive, 69 

simple, 143 

standard form for, 143 

singular, 175 

strictly monotone, 171 

total variation of, 172 
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functional 

bounded linear, 96 

bounded sesquilinear, 209 

continuous linear, 96 

evaluation, 28 

linear, 28 

sesquilinear, 209 
functionally separated, 131 
T{S, T) (set of functions from S to T), 8 
HS) 

as a commutative unital algebra, 29 

as a lattice, 37 

as a partially ordered set, 33 

as a Riesz space, 101 

as a vector space, 26 

as an ordered vector space, 67 

set of real valued functions on S, 8 

Ta{V,W) 

functions in a neighborhood of a, 97 
functor 

adjoint, 211 

contravariant, 112 

covariant, 112 

diagonal, 113 

duality, 113 

forgetful, 153 

second dual 

for Banach spaces, 195 

for finite dimensional vector spaces, 195 

the natural injection as a, 196 

the natural map as a, 195 
fundamental quotient theorem, see quotient theorem, 

fundamental 
fundamental theorem of calculus, 176 

gauge, 137 

Gs subset (of a topological space), 110 
Gelfand 

topology, 225 

transform, 228 
Gelfand-Mazur theorem, 224 
Gelfand-Naimark theorem, 230 
generalized Stone- Weierstrass theorem, 165 
generating cone, 68 
geometric category, 92 
G-hereditary, 86 

the Baire property is, 217 
good sets principle, 110 
Gram-Schmidt orthonormalization, 208 
graph, 8 
greatest 

element, 34 
greatest lower bound, 34 
Greek letters, ix 
Grothendieck 

group, 45 

map, 46 
Grothendieck construction, 45 
group, 22 

Abelian, 22 
factor, 61 



quotient, 61 
Grothendieck, 45 

Holder's inequality, 157 
Hahn 

decomposition, 124 
Hahn-Banach theorem, 213 
Hahn-Tong-Katetov theorem, 167 
Hamel basis, 36 
Hausdorff, 74 
Heine-Borel theorem, 129 
hereditary, 86 
hermitian, 228 

H{A) (hermitian elements in a *-algebra, 228 
HILi 

the category, 211 
HILoo 

the category, 211 
Hilbert space, 159 

LiiS) as a, 160 

h as a, 159 

basis for, 208 

direct sum, 162 
Hom(G), 23 

as a unital ring, 28 
Hom(G, H) 

(semi) group homomorphisms) , 23 

as an Abelian group, 24 
homeomorphism, 81 
homomorphism 

group, 23 

lattice, 39 

monoid, 24 

of algebras, 31 
unital, 31 

of Banach algebras, 158 

of rings, 30 
unital, 30 

Riesz, 104 

semigroup, 23 
horizontal section, 187 
hull, convex, 68 

ideal 

inC(X), 169 

in an algebra, 30 

left, 30 

maximal, 30 

order, 117 

generated by a set, 118 

principal, 30 

principal order, 118 

proper, 30 

right, 30 
idempotent, 30 
identity 

element, 22 

function, 9 

I, Is, ids, id (notations for the identity function), 9 
image 
inverse 
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of a set, 9 

of a function, 9 

of a point, 8 

of a set, 9 
imaginary 

part of a complex number, 49 
inclusion map, 10 

s (inclusion map of A into S) , 10 
increasing 

function, 171 

net, 122 

net in R, 121 

sequence, 104 

sequence of real numbers, 57 
sequence of sets, 54 
strictly, 171 
M[a,b]) 

increasing functions with /(a) = 0, 174 
indefinite integral, 176 
independent 

events, 181 

linearly, 36 
index set, 52 
indexed family, 52 
indiscrete topology, 72 
induced 

measure, 139 

partial ordering, 68 

topology, 73 

topology (by a metric), 77 
inductive, 44 
inequality 

Bernoulli's, 49 

Bunyakovskii, 48 

Cauchy, 48 

Minkowski, 48 

Schwarz, 48, 160 

strict, 56 

triangle, 47, 75 

weak, 56 
inf (infimum of a set), 34 
infimum, 34 
infinite, 17 

countably, 19 

dimensional, 37 

word, 20 
infinite series, 152 

convergence of an, 121, 152 

partial sum of an, 121, 152 

sum of an, 121, 152 
infinity 

as an extended real number, 55 

axiom of, 45 

point at, 132 

vanish at, 156 
injection, 11 
injective, 11 
inner 

product, 159 

regular 



Borel measure, 194 
inner product space 

direct sum, 162 
input space, 8 
integrable, 147 
Riemann, 120 
Riemann-Stieltjes, 178 
integral 

indefinite, 176 
Lebesgue, 147 

vs. Riemann integral, 149 
of a positive function, 145 
of a simple function, 144, 145 
operator, 210 

adjoint of an, 210 
kernel of a, 210 
Riemann, 120, 178 
Riemann-Stieltjes, 178 

with respect to a finitely additive set function, 144, 
145 

with respect to a positive measure, 145 
I f = Is-f'^^^ (integral of /) 

if / is simple, 144 
j f da (Riemann-Stieltjes integral of / with respect 

to a, 178 
integrand, 178 
integrator function, 178 
interior 

characterization by sequences, 79 
characterization of by nets, 122 
of a set, 79 
point, 79 

inty (interior with respect to Y), 86 
intermediate value theorem, 199 
internal 

orthogonal direct sum, 160 
intersection, 2 
interval 

order, 104 
intrinsic property, 85 

being nowhere dense is not an, 216 

compactness is an, 127 
invariant 

under translations, 139 
inverse 

image 
of a set, 9 

left, 12, 22, 60 

of a function, 12 

of a morphism, 60 

of an element with respect to an operation, 22 

right, 12, 22, 60 
invertible, 12, 22, 60 

conditions for a function to be, 13 

function, 13 
involution, 39, 227 
isolated point, 130 
isometric 

isomorphism, 92 
isomorphic, 24, 30, 31 
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isomorphism 

in a category, 60 
isometric, 92 
of algebras, 31 
of rings, 30 
of semigroups, 24 
order, 35 

J A (order ideal generated by A), 118 

Ja (principal order ideal), 118 

Jc (continuous functions vanishing on C), 169 

join, 38 

Jordan decomposition theorem, 101 

Katetov theorem, 167 

kernel, 27 

of a linear map, 27 

of a ring homomorphism, 30 

of an Abelian group homomorphism, 25 

of an algebra homomorphism, 31 

of an integral operator, 210 

kerT (kernel of a linear map), 27 

Knaster-Tarski fixed point theorem, 39 

L-infinity norm, 141 
h 

as a Hilbert space, 159 
A (Lebesgue measure on R), 139 
large, 52 
large in, 112 
larger topology, 72 
largest 

element, 34 
lattice, 37 

T{S) as a, 37 

^P(5') as an order complete, 37 
Banach, 163 
distributive, 38 
homomorphism, 39 
norm, 163 
vector, 101 
LAT 

bijective morphism in, 61 
law 

De Morgan's, 4, 40 
distributive, 2, 21 
of absorption, 38 
of trichotomy, 43 
least 

common refinement, 172 

element, 34 
least upper bound, 34 
Lebesgue 

-Radon-Nikodym theorem, 193 

decomposition theorem, 193 

dominated convergence theorem, 147 

integrable function, 147 

integral, 147 

measurable 
function, 140 
set, 138 



measure, 139 
outer measure, 138 
singular function, 130, 175 

as a Riesz space, 147 

family of Lebesgue integrable functions, 147 
jCoo{S) (essentially bounded functions), 140 
Lebesgue integral, see integral 
Li(K) 

as a commutative Banach algebra, 191 
U{S) 

as a Banach lattice, 163 
as a Banach space, 156 
as a normed linear space, 148 
as a Riesz space, 148 
U{S) 

as a Hilbert space, 160 
Loc(S') 

as a Banach algebra, 156 

as a Banach lattice, 163 

as a normed algebra, 141 

as a normed linear space, 140 

as a Riesz space, 141 
Lp(5) 

as a Banach lattice, 163 
as a Banach space, 157 
as a normed linear space, 156 
dual of, 197 
left 

cancellable, 29, 60 
ideal, 30 
inverse, 22 

of a function, 12 

of a morphism, 60 
length, 89 

lexicographic ordering, 33 

induced by a positive cone in R'^, 69 
liminf, 53 
limit 

inferior 

of a sequence of real numbers, 55 
of a sequence of sets, 53 
of a net, 119 

of a sequence in a metric space, 79 

of a sequence of measurable function, 112 

of a sequence of real numbers, 55 

of a sequence of sets, 53 

point, 79 

pointwise, 90 

right-hand, 171 

superior 

of a sequence of real numbers, 55 
of a sequence of sets, 53 
uniform, 90 
limsup, 53 
Lindelof, 201 
linear, 26 

combination, 35 

trivial, 35 
conjugate, 209 
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dependence, independence, 36 
functional, 28 

bounded, 96 

continuous, 96 

positive, 105 
map, 26 

order bounded, 106 

positive, 105 

regular, 105 
operator, 26 
ordering, 33 
span 

closed, 161 
transformation, 26 
£(!/) 

Eis a unital algebra, 29 
linear operators on V , 26 
£(y, W) 

linear maps between vector spaces, 26 
2b{V,W) 

order bounded linear maps, 106 
linearly ordered set, 33 

£^r(.V,W) 

regular linear maps, 105 
Liouville's theorem, 223 
locally 

compact, 130 

constant, 189 
lower 

bound, 34 

semicontinuous, 165 
Lp norm, 156 
^1 

as a normed linear space, 92 

as a normed linear space, 90 
Lusin's theorem, 204 

Ai^ (multiplication operator), 210 
majorize, 34, 105 
map, 8 

adjoint, 113 

linear, 26 

morphism, 112 

object, 112 
mapping, 8 
matrix, 23 
M„ 

as a Banach algebra, 156 

as a unital algebra, 29 

as a vector space, 25 

set of n X n matrices, 23 
maximal 

element, 34 

ideal, 30 

space, 225, 226 
meager, 215, 216 
mean 

convergence in, 203 
mean value theorem for integrals 

first, 180 



second, 180 
measurable 

decomposition, 143 
function, 111, 140 
Lebesgue, 138 
partition, 143 
rectangle, 187 
space, 109 
spaces 

coproduct of, 187 

product of, 187 
subset, 109 

with respect to an outer measure, 137 
M{S) 

as a Riesz space, 112 
as an algebra, 112 

real valued measurable functions on S, 111 
M{S,T) (measurable functions between measurable 

spaces). 111 
measure 
Borel, 194 

inner regular, 194 

outer regular, 194 

regular, 194 
complex, 115 
convergence in, 203 
counting, 116, 135 
Dirac, 116 

induced on a measurable subset, 139 

Lebesgue, 139 

negative set for a, 124 

outer, 137 

positive, 115 

positive set for a, 124 

probability, 181 

induced by a random variable, 184 
product, 188 
Radon, 197 
real, 115 
signed, 115 
space 

complete, 189 

completion of a, 190 

complex, 115 

positive, 115 

product, 188 

real, 115 

signed, 115 
meet, 38 
member, 1 
metric, 75 
bounded, 78 
discrete, 75 
Euclidean, 76 
induced by a norm, 89 
on C, 49 
space, 75 

completion of a, 153 
taxicab, 76 
uniform, 78 
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on R", 76 
usual 
on R, 75 
on R", 76 
metric space 
R" as a 

with the Euclidean metric, 76 
with the taxicab metric, 76 
with the uniform metric, 76 
complete, 151 
metrics 

equivalent, 77 
strongly equivalent, 77 
metrizable, 131 
minimal 

element, 34 
minimizing vector theorem, 161 
Minkowski inequality, 48 
Minkowski's inequality, 157 
minorize, 34 

DJlx (Lebesgue measurable subsets of R), 139 

m-measurable, 137 

m-nuU set, 138 

model, 44 

monic, 60 

monoid, 22 

homomorphism, 24 
monomorphism, 60 
monotone 

class, 110 

convergence 

of sequences, 104 
theorem, 146 

function, 171 

piecewise, 173 

sequence, 104 

sequence of real numbers, 57 

sequence of sets, 54 
morphism 

map, 112 

of a category, 59 

universal, 153 
multiplication 

operator, 118 

pointwise, 29 
multiplication operator 

adjoint of a, 210 

on L2{S), 210 
multiplicative linear functional, 224 
mutually disjoint, 2 
mutually exclusive, 181 
mutually separated, 199 
mutually singular 

elements of a Riesz space, 123 

measures, 124 
M{X) 

as a Banach space, 198 

as a normed linear space, 194 

as a Riesz space, 194 

regular Borel measures on X, 194 



N (set of natural numbers), 5 
natural 

equivalence, 195 

the Radon map as a, 198 
injection 

as a functor, 196 
for Banach spaces, 196 
map 

as a functor, 195 
for vector spaces, 195 
transformation, 195 

the Radon map as a, 196 
negative 

part (of a Riesz space vector), 101 

set (for a measure), 124 
neighborhood, 74 

base for, 120 

basic, 120 

of radius e in R, 57 
net, 119 

bounded, 121 

cluster point of a, 119 

convergence of a, 119 

decreasing, 122 

increasing, 121, 122 

limit of a, 119 
nilpotent, 224 
NLSoo 

products in, 94 
N„ (first n natural numbers), 5 
nonmeager, 216 
nonmeasurable set 

existence of, 139 
nontrivial 

vector space, 25 
norm, 89 

1- (of a sequence), 92 

1- (on R"), 90 

1- (on the product of normed spaces), 94 

2- (on the product of normed spaces), 94 
dual, 96 

essential supremum, 141 
Euclidean 
on R", 89 

on the product of normed spaces, 94 
L-infinity, 141 
lattice, 163 
Lp, 156 

of a bounded linear map, 94 

of a bounded sesquilinear functional, 209 

product, 94 

Riesz, 163 

uniform 

on R", 90 

on ioo, 90 

on the product of normed spaces, 94 
on B{S), 90 
on bIs, V), 90 
usual 

on R", 89 
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variation, 173 
normal 

element in a '-algebra, 228 

topological space, 166 
N(A) (normal elements in a '-algebra, 228 
normed algebra, 95 

B(S) as a, 95 

CbiX) as a, 95 
as a, 95 
(5) as a, 141 

5im(S') as a, 144 

unital, 95 
normed linear space, 89 

R" as a, 89, 90 

ba(5) as a, 91, 125 

ca(S) as a, 125 

B{S) as a, 90 

B{S, V) as a, 90 

CbiX) as a, 95 

BV{[a,b]) as a, 173 

Loo (5) as a, 140 

Lp(S') as a, 156 

'B{V,W) as a, 94 

M(X) as a, 194 

h as a, 92 

Zoo as a, 90 

BVo{[a,b]) as a, 174 

direct sum, 94 

Li(S') as a, 148 

product, 94 

quotient, 96 
normed Riesz space, 163 

normed vector space, see normed linear space 
NLSoo 

duality functor on, 113 

the category, 92 
NLSi 

the category, 92 
norms 

equivalent, 92 
nowhere dense, 215 
null 

set, 2 

with respect to an outer measure, 138 

space, 27 
(empty set, null set), 2 
numbers 

complex, 49 

extended real, 55 

real, 43 

special sets of, 5 

object 

map, 112 
object (of a category), 59 
0{V,W) ("big-oh" functions), 97 
o{V,W) ("little-oh" functions), 97 
1-norm 

of a sequence, 92 

on R", 90 

on the product of normed spaces, 94 



one-point compactification, 132 
one-to-one, 11 

correspondence, 12 
onto, 11 
open 

ball, 75 
unit, 75 

cover, 127 

disk, 20 

map, 87 

mapping, 218 
theorem, 218 

set, 71 
basic, 73 

unit disc, 5 
operation 

binary, 21 

unary, 21 
operator, 93 

integral, 210 

adjoint of an, 210 

linear, 26 

multiplication, 118 

adjoint of a, 210 

on 1/2(5'), 210 
projection, 220 
topology 

strong, 222 

weak, 222 
unilateral shift, 210 

adjoint of, 210 
opposite of a partially ordered set, 103 
order 
bounded 

linear map, 106 

set, 106 
complete, 34 
continuity 

sequential, 116 
dual, 106 
ideal, 117 

generated by a set, 118 

principal, 118 
interval, 104 
isomorphism, 35 
preserving, 35 
reversing, 35 
ordered 
pair, 7 

vector space, 67 

R2 as a, 68, 69 

R" as a, 67 

ba(S, 21) as a, 70 

T{S) as a, 67 

s as a, 67 
ordering 

coordinatewise, 33 
lexicographic, 33, 69 
linear, 33 
on R 
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induced by P, 43 
partial, 33 
pointwise, 33 
pre-, 33 
total, 33 
usual 

on J^{S), 67 

on 68 
well-, 45 
orthogonal, 160 
complement, 161 
direct sum, 162 
elements of a Riesz space, 123 
projection, 231 
orthonormal, 208 
basis, 208 
set 

complete, 208 
oscillation. 111 
outcomes, 181 

equUy likely, 181 
outer 

measure, 137 

Lebesgue, 138 
regular 

Borel measure, 194 
output space, 8 

pairwise disjoint 

family of sets, 2 

indexed family of sets, 53 
parallelogram law, 160 
Parseval's identity, 208 
part 

imaginary, 49 

negative, 101 

positive, 101 

real, 49 
partial 

ordering, 33 

induced by a proper convex cone, 68 

sum, 121, 152 
partially ordered set, 33 

J^{S) as a, 33 

power set as a, 33 
partially ordered set,dual of a, 103 
partially ordered set,opposite of a, 103 
POSET 

bijective morphism in, 61 

the category, 61 
partition 

measurable, 143 

of a set, 14 

of an interval, 172 

with selection, 120, 177 
■p(J) (partitions of an interval), 177 
path, 200 

path connected, 200 

perfect set, 130 

perpendicular, 160 

tta (coordinate projections), 64 



piecewise 

monotone, 173 
Pm (projection onto a Hilbert subspace), 207 
point, 1 

accumulation, 79 

at infinity, 132 

interior, 79 

isolated, 130 

limit, 79 
pointwise 

addition, 23 

bounded, 155, 221 

convergence, 90 

topology of, 85, 122 

limit, 90 

multiplication, 29 

ordering, 33 
polarization identity, 161 
polynomial 

trigonometric, 208 
positive 

cone 
in R, 43 

in an ordered vector space, 68 

element (of an ordered vector space) , 68 

function, 69 

linear functionals, 105 

linear map, 105 

measure, 115 
space, 115 

part (of a Riesz space vector), 101 

real finitely additive set function, 69 

real numbers, 43 

set (for a measure), 124 

strictly, 43 
P (positive cone in R), 43 
power set (of a set), 2 

as a Boolean algebra, 40 

as a distributive lattice, 38 

as a Boolean ring, 30 

as a partially ordered set, 33 

as a ring under A and n, 28 

as an Abelian group under A, 23 

as an order complete lattice, 37 

power set of 5, 2 
pre-annihilator, 214 
preordered set, 33 
preordering, 33 
principal 

band, 119 

ideal, 30 

order ideal, 118 
principle 

good sets, 110 

of uniform boundedness, 221 
probability 

as a measure of conviction, 182 
conditional, 182 
distribution, 184 
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measure, 181 

induced by a random variable, 184 
relative frequency interpretation of, 182 
space, 181 

discrete, 181 
product, 28 

Cartesian, 7, 64 

complete, 64 

direct, 64 

in BALG, 158 

in NLSoo, 94 

in SET, 63, 64 

in TOP, 85 

in VEC, 64 

in a category, 63, 64 

inner, 159 

measure, 188 

measure space, 188 

norm, 94 

of measurable spaces, 187 

of normed linear spaces, 94 

topology, 85 

uniqueness of, 63 
progression 

arithmetic, 74 
projection 

in a Banach space, 220 

onto coordinates, 64 

orthogonal, 231 
proper 

cone, 68 

containment, 2 

ideal, 30 

subset, 2 
property 

Archimedean 

of the real numbers, 44 

cancellation, 29 

intrinsic, 85 
propositional calculus, 40 

as a Boolean algebra, 40 
propositions, 40 
pseudometric, 75 
Pythagorean theorem, 160 

Q (set of rational numbers), 5 
quotient, 14, 40 
algebra, 63 
group, 61 
maps, 61 

in AbGp, 61 

in BALG, 169 

in BAN, 218 

in RSP, 118 

in RS, 118 

in SET, 61 

in TOP, 87 

in VEC, 62 
normed linear space, 96 
object, 61 
Riesz space, 118 



topology, 86 

vector space, 62 
quotient map, the, 14, 63 
quotient theorem, fundamental 

for AbGp, 62 

for BALG, 219 

for NLSt^, 96 

for RSP, 118 

for RS, 118 

for TOP, 87 

for VEC, 63 

for SET, 15 

R (set of real numbers), 5 
radius 

spectral, 224 
Radon 

map, 196 

as a natural equivalence, 198 
as a natural transformation, 196 

measure, 197 
Radon-Nikodym 

derivative, 146, 193 

theorem, 193 
random, 182 

variable, 184 
range, 9 

essential, 141 
ran / (range of a function /) , 9 
rare, 215 

rca(X) (regular Borel measures on X), 194 
real 
line 

special subsets of, 5 

usual metric on, 75 

usual topology on, 73 
measure, 115 

space, 115 
numbers 

axioms for, 43 

construction of the, 44 

definition of, 43 

extended, 55 

field properties of, 43 

positive, 43 
part of a complex number, 49 
valued function, 8 

bounded, 26 
vector space, 25 
R (extended real numbers), 55 
R2 

as a Riesz space 

with its usual ordering, 101 
with lexicographic ordering, 101 

as an ordered vector space 
ordered lexicographically, 69 
with its usual ordering, 68 
with non-generating cone, 69 

R" 

as a metric space 

with the Euclidean metric, 76 
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with the taxicab metric, 76 
with the uniform metric, 76 

as a normed hnear space, 89, 90 

as a vector space, 25 

as an ordered vector space, 67 

is the set of n-tuples of real numbers, 5 

ordered coordinatewise, 33 

ordered lexicographically, 33 
rectangle 

measurable, 187 
refinement, 120, 172 

least common, 172 
refinement (of a partition), 177 
reflexive 

Banach space, 214 

ordering, 8 
regular 

Borel measure, 194 

linear map, 105 

topological space, 132 
relation, 7 

equivalence, 14 
relative 

complement, 4 

frequency interpretation, 182 
relatively 

compact, 156 

consistent, 44 
representative, 14 
residual, 216 
resolvent mapping, 223 
respects operations, 67 
restriction, 10 
retract, 168 
retraction, 168 
Riemann 

condition, 179 

integrable, 120 

integral, 120, 178 

vs. Lebesgue integral, 149 

sum, 120 
Riemann-Stieltjes 

integrable, 178 

integral, 178 

as a Riemann integral, 178 
change of variables in, 178 

sum, 177 

TZ{a) (functions R-S integrable with respect to a), 

178 
Riesz 

decomposition lemma, 103 
homomorphism, 104 
isomorphism, 105 
norm, 163 

representation theorem, 196-198 
subspace, 103 
Riesz space, 101 
Archimedean, 104 
ba(5) as a, 102 
B{S) as a, 101 



BVo{[a,b]) as a, 174 
BV{[a,b]) as a, 173 
ca(5') as a, 117 
C{X) as a, 101 
T{S) as a, 101 
£i(/i) as a, 147 
Li{S) as a, 148 
(S) as a, 141 
M{S) as a, 112 
M(X) as a, 194 
normed, 163 
quotient, 118 
R as a, 101 
R^as a 

with its usual ordering, 101 

with lexicographic ordering, 101 
5im(S) as a, 144 
Riesz-Frechet theorem, 161 
RSP 

bijective morphisms in, 105 
quotients in, 118 
the category, 105 
RS 

bijective morphisms in, 105 

quotients in, 118 

the category, 104 
right 

cancellable, 29, 60 

ideal, 30 

inverse, 22 

of a function, 12 
of a morphism, 60 
right-continuous, 171 
right-hand limit, 171 
ring, 28 

Boolean, 30 

commutative, 28 

homomorphism, 30 
unital, 30 

unital, 28 

S (unilateral shift operator), 210 

(unit circle), 5 
sample space, 181 
scalar, 25 

multiplication, 25 
Schwarz inequality, 48, 160 
second 

category, 216 

dual 

functor for Banach spaces, 195 
functor for finite dimensional vector spaces, 195 
second countable, 74 
section 

horizontal, 187 

vertical, 187 
segment, closed, 68 
selection, 120, 177 
self-adjoint, 228 

family of functions, 165 

subset, 230 
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semicontinuous 

functions, interpolation between, 167 

lower, 165 

upper, 165 
semigroup, 21 
seminorm, 89 
separable, 201 
separated 

mutually, 199 
separating family, 131 
separation 

by functions, 131 

by open sets, 74 
separation axiom, 132 

Ti, 132 

T2, 132 

Ts, 132 

Ti, 166 

T31, 132 

completely regular, 132 
Hausdorff, 74 
normal, 166 
regular, 132 
Tychonoff, 132 
s, s(R) 

as an ordered vector space, 67 

set of sequences of real numbers, 67 
sequence, 51 

bounded, 51, 78 

Caucliy, 151 

convergence of, 79 

decreasing, 57, 104 

increasing, 57, 104 

limit of a, 79 

monotone, 57, 104 

weak Cauchy, 222 

weak convergence of a, 222 
sequences 

monotone convergence of, 104 
sequential 

compactness, 127 

covering class, 136 

order continuity, 116 
series 

infinite, 121, 152 

sum of a, 121, 152 
sesquilinear functional, 209 

bounded, 209 
set, 1 

Cantor, 129 

Cantor-like, 130 

difference, 4 

directed, 119 

membership in a, 1 

perfect, 130 

subtraction, 4 

underlying, 60 
set function 

finitely additive, 69 
bounded, 69 



positive, 69 
SET 

bijective morphism in, 61 

coproducts in, 65 

products in, 63, 64 

quotients in, 61 

the category, 59 
Sierpinski topological space, 72 
c"s(t?) (cr-algebra generated by 5), 109 
(j-algebra, 109 
(T-finite measure space, 136 
a-order continuous, 116 
a(o) (the spectrum of a), 141 
"l^iei (Hilbert space sum), 207 
(T-Dedekind complete, 34 
signed 

measure, 115 
space, 115 
simple 

function, 143 

standard form for, 143 
5im(5') 

as a Banach lattice, 163 

as a normed algebra, 144 

as a Riesz space, 144 

as an algebra, 144 

simple functions, 143 
singular 

function, 175 

mutually, 123, 124 
small in, 112 
smaller topology, 72 
smallest 

doohickey of type X, 35 

element, 34 
solid, 117 

Sorgenfrey line, 201 
space 

Baire, 217 

Banach, 156 

carrier, 225 

character, 225 

complete measure, 189 

complete metric, 151 

complex measure, 115 

Hilbert, 159 

input, 8 

maximal ideal, 225, 226 
measurable, 109 
metric, 75 
normed linear, 89 
normed Riesz, 163 
output, 8 

positive measure, 115 
probability, 181 
discrete, 181 
product measure, 188 
real measure, 115 
sample, 181 
Sierpinski, 72 
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signed measure, 115 

target, 8 

topological, 71 
span, 35 

of the empty set, 36 
spectral 

mapping theorem, 231 

radius, 224 

formula for, 224 

theorem, 231 
spectrum, 141 
sphere, 75 

unit, 75 

Sr{a) (sphere of radius r about a), 75 
square integrable, 160 
square summable, 159 
standard basis vectors, 27 
standard deviation, 185 
standard form 

for a simple function, 143 
"-algebra, 227 
*-homomorphism, 227 
*-subalgebra, 230 
Stone-Cech compactification, 132 
Stone- Weierstrass theorem, 164, 165 

generalized, 165 
strict inequahty, 56 
strictly 

decreasing, 171 

increasing, 171 

monotone, 171 

positive, 43 
strong 

convergence 

of Hilbert space operators, 222 

equivalence 

induces identical topologies, 78 
of metrics, 77 

operator topology 

sequential convergence in the, 222 

topology, 86 
stronger topology, 72 
subadditive 

countably, 69, 135 
subalgebra, 30 

unital, 30 
subbase, 82 
sub cover, 127 
subgroup, 23 
sublinear, 213 
submultiplicative, 95 
subsequence, 58 
subset, 2 

proper, 2 
C (subset of), 2 
subspace 

complemented, 220 

of a Banach space, 161 

of a topological space, 73, 85 

of a vector space, 26 



Riesz, 103 
successor, 45 
sufficiently large, 52 
sum 

direct, 64, 65 

of a summable set, 121 

of an infinite series, 121, 152 

partial, 121, 152 

Riemann, 120 

Riemann-Stieltjes, 177 
summable, 92, 121, 207 

absolutely, 92 

square, 159 
sup (supremum of a set), 34 
support 

of a function, 65 
supremum, 34 

essential, 140 
surjective, 11 
symmetric, 8 
symmetric difference, 4 

associativity of, 4 

T (unit circle), 5 
Ti, 132 
Ta, 132 
Ts, 132 

Tgl, 132 

T4, 166 

tangent (at zero), 98 
target space, 8 

tb (natural injection of B into B**), 196 

taxicab metric, 76 

term 

of a sequence, 51 
ternary expansion, 130 
Tietze extension theorem 

complex version, 169 

real version, 168 
Tonelfi's theorem, 190 
Tong theorem, 167 
topological 

category, 92 

dual, 96 
topological space, 71 

connected, 199 

disconnected, 199 

first countable, 201 

Hausdorflt, 74 

Lindelof, 201 

second countable, 74 

separable, 201 

subspace of a, 73 
topology, 71 

Banach space weak, 215 

Banach weak star, 215 

cofinite, 72 

discrete, 72 

Gelfand, 225 

indiscrete, 72 

induced by a metric, 77 
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larger, stronger, finer, 72 

of pointwise convergence, 85, 122 

base for, 85 
product, 85 
quotient, 86 
Sierpinski, 72 

smaller, weaker, coarser, 72 
strong, 86 

operator, 222 
usual 

on 1, 73 

on K, 73 
weak, 84 

operator, 222 
TOP 

bijective morphisms in, 82 

coproducts in, 85 

products in, 85 

quotient maps in, 87 

quotients in, 86 

the category, 81 
total 

boundedness, 152 

ordering, 33 

orthonormal set, 208 

variation, 125, 172 
totally 

disconnected, 201 
transcendental number, 20 
transformation, 8 

linear, 26 

natural, 195 
transitive, 8 
translation, 91 

invariant, 139 
transpose 

conjugate, 227 
triangle inequality, 47, 75 
trichotomy, 43 

trigonometric polynomial, 208 
trivial 

Boolean algebra, 40 

linear combination, 35 
2-norm, 94 

two-point duplication property, 164 
TychonofT 

topological space, 132 

unary operation, 21 
uncountable, 19 
underlying set, 60 
uniform 

boundedness, 155 
principle of, 221 

continuity, 83 

convergence, 90 

equicontinuity, 155 

limit, 90 

metric, 78 
on R", 76 

norm 



of a sequence, 92 

on the product of normed spaces, 94 
on B(5), 90 
on bIs,V), 90 
on K", 90 
on Zoo, 90 
uniformly 

bounded, 221 
unilateral shift, 210 

adjoint of, 210 
union, 2 

disjoint, 3, 65 
W, [+1 (disjoint union), 3, 65 
uniqueness 
essential, 63 
of products, 63 
unit 

closed ball, 75 
open ball, 75 
sphere, 75 
unit vector, 89 
unital 

*-homomorphism, 227 
algebra, 29 

algebra homomorphism, 31 
normed algebra, 95 
ring, 28 

ring homomorphism, 30 
subalgebra, 30 
unitary, 228 

U {A) (unitary elements in a *-algebra, 228 
universal 

morphism, 153 

object, 153 

uniqueness of, 155 

property, 153 
upper 

bound, 34 

semicontinuous, 165 
Urysohn's lemma 

for locally compact Hausdorff spaces, 131 
for normal spaces, 169 
usual 

metric on R, 75 
metric on R", 76 
norm 

on R", 89 
ordering 

on J"(5), 67 

on R^ 68 
orthonormal basis for I2, 208 
topology onR, 73 

vanish at infinity, 156 

Vf (variation function of /), 174 

variable 

random, 184 
variance, 185 
variation 

bounded, 172 

function, 174 
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norm, 173 
total, 125, 172 



divisor, 29 



vector, 25 

decomposition theorem, 162 
lattice, 101 
unit, 89 



vector space, 25 
zero set, 158 
zero-dimensional, 201 
Zf (zero set of /), 158 
Zorn's lemma, 35 



vector space, 25 
complex, 25 
coproduct, 65 
direct sum, 64 
factor, 62 
nontrivial, 25 
normed, 89 
ordered, 67 
product, 64 
quotient, 62 
real, 25 

subspace of, 26 
zero, 25 
VEC 

bijective morphism in, 27 
coproducts in, 65 
products in, 64 
quotients in, 62 
the category, 59 
vertical section, 187 

w-bounded, 221 

TO*-topology (weak star topology), 215 
weak 

boundedness 

in a normed linear space, 221 

of sequences of Hilbert space operators, 222 
Cauchy sequence, 222 
convergence 

of Hilbert space operators, 222 

of sequences in a normed linear space, 222 
inequality, 56 
operator topology 

boundedness in, 222 

sequential convergence in the, 222 
topology, 84 

on a Banach space, 215 
weaker topology, 72 

Weierstrass approximation theorem, 164, 165 
well-defined, 15 
well-ordered set, 45 
word 

finite, 20 

infinite, 20 

Z (set of integers), 5 

as a Boolean algebra, 40 

as a Boolean ring, 30 

as a field, 29 

as a ring, 28 

as an Abelian group, 23 
Zermelo's postulate, 13 
zero 

bounded away from, 173 



